Gene Dret_1139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDret_1139 
Symbol 
ID8418966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfohalobium retbaense DSM 5692 
KingdomBacteria 
Replicon accessionNC_013223 
Strand
Start bp1335301 
End bp1336608 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content60% 
IMG OID645037713 
ProductPhenylacetate--CoA ligase 
Protein accessionYP_003198005 
Protein GI258405263 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1541] Coenzyme F390 synthetase 
TIGRFAM ID[TIGR02155] phenylacetate-CoA ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000376262 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTACG ATGTGGAAAA GGAAACCTTG CCCCGCGAAG AACTCGAAGC ATTGCAATTG 
CGCAAGTTGC AGTCCCTCGT GGAGCGCGTC TACTACAACG TGCCCTTCTA TCGCGAAAAA
CTGGATGAAG CCGGAGTCAG GCCCGAAGAC ATCCGCAGCC TGCGGGATGT ACAATTCCTC
CCGTTTACCG AAAAGCAGGA TCTGCGCAAC AATTACCCCT TTGGCCTCTT TGCCGTCCCC
CGCGAAAATG TCGTCCGCAT CCACGCCTCT TCCGGAACGA CTGGCAAGGC CACTGTCGTC
GGCTATACCC AGCGCGATGT CCGCAATTGG GCCACGCTCA TGGCCCGTTC GCTGATGGCC
GCCGGAGCGA CACGGCGGGA CACCGTGCAC AACGCTTACG GCTACGGGTT GTTTACCGGC
GGCCTGGGCG TCCATTACGG CGCGGAACAA CTCGGGGCCT CTATCGTGCC CATTTCAGGC
GGCGGCACCA AGCGCCAGGC CACCCTGCTC AAGGACTTCG GCCCCACTGT CATCTGCTGC
ACCCCTTCCT ACGCCCTGCA TCTCTATGAA ACGGCCAAGG CCGGTGGCCT GGATGTTGAA
AATCTTCCCC TGCACACCGG AATCTTCGGC GCCGAACCTT GGACCGACGA AATGCGAGCC
GACCTCGAAT CCAAGCTGGG CATCAAGGCC TTGGACATCT ATGGTCTGTC CGAGATCATG
GGACCCGGCG TCGCCATGGA ATGCCGCCCC GCCCAGGACG GGCTGCATAT CTGGGAAGAC
CACTTCCTGG TCGAGACCAT CGATCCCGAA ACCGGGGAAC AGCTTGCTCC CGGGGAGACC
GGGGAATTGG TGATCACGAC GCTGTCCAAG GAGGCGCAGC CGCTCCTGCG CTACCGCACC
CGGGACCTGA CCCGCCTGAA CACCGTCCCC TGCCGCTGCG GGCGGACCCA TACCCGTATG
GCCCGGGTCA TGGGACGCAG TGACGACATG CTCATCATCC GTGGCGTGAA CGTCTTTCCG
TCGCAAATCG AGAGCATCCT GCTCGAAACC GAGGGCATTG CGCCGCACTA CCAGCTCATC
CTGCGCCGTC ACGGTTCGCT GGACACACTT GAAATCCACG TCGAAATCGA CGATTCCACT
TTCTCCGACG AGATCAAGCA TTTACAACGT CTTGAGCGCA AGATACAGAA AAACATCAAA
GAGTTCCTGG GTGTGACCGC GGATATCAAG CTCGCCGAAC CGATGAGTAT CGCCCGCTCC
CAGGGCAAAG CCCAGCGGAT TATCGATCGT CGCCACGAAG CCGAATAA
 
Protein sequence
MLYDVEKETL PREELEALQL RKLQSLVERV YYNVPFYREK LDEAGVRPED IRSLRDVQFL 
PFTEKQDLRN NYPFGLFAVP RENVVRIHAS SGTTGKATVV GYTQRDVRNW ATLMARSLMA
AGATRRDTVH NAYGYGLFTG GLGVHYGAEQ LGASIVPISG GGTKRQATLL KDFGPTVICC
TPSYALHLYE TAKAGGLDVE NLPLHTGIFG AEPWTDEMRA DLESKLGIKA LDIYGLSEIM
GPGVAMECRP AQDGLHIWED HFLVETIDPE TGEQLAPGET GELVITTLSK EAQPLLRYRT
RDLTRLNTVP CRCGRTHTRM ARVMGRSDDM LIIRGVNVFP SQIESILLET EGIAPHYQLI
LRRHGSLDTL EIHVEIDDST FSDEIKHLQR LERKIQKNIK EFLGVTADIK LAEPMSIARS
QGKAQRIIDR RHEAE