Gene EcolC_1192 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1192 
Symbol 
ID6065531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1306492 
End bp1307844 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content56% 
IMG OID641600608 
Producthydrogenase 4 subunit D 
Protein accessionYP_001724186 
Protein GI170019232 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG1009] NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGTTT TGTTCGCCGC ACTGACCACG CTGTGCATGT TGTCGCTGAT CTCCGCGTTT 
TATCAGGCCG ATAAAGTTGC CGTCACGTTG ACGTTGGTCA ACGTGGGGGA TGTGGCATTG
TTTGGCCTGG TCATTGATCG CGTGAGTACG CTGATTCTGT TTGTGGTGGT GTTTCTCGGT
TTGCTGGTCA CGATCTACTC CACGGGTTAT CTGACGGATA AAAATCGCGA ACACCCGCAT
AACGGCACGA ATCGTTATTA CGCATTTTTG CTGGTGTTTA TCGGCGCGAT GGCGGGACTG
GTACTCTCCT CGACGCTGCT CGGTCAGTTG TTGTTTTTTG AAATTACGGG CGGCTGCTCC
TGGGCGTTGA TCAGTTATTA CCAGAGCGAT AAAGCGCAGC GTTCAGCACT AAAAGCGTTA
CTTATCACTC ATATCGGCTC GTTGGGGTTG TATCTTGCCG CCGCCACGCT GTTTTTGCAG
ACCGGAACGT TTGCGCTTAG CGCGATGAGC GAGTTACACG GCGACGCACG TTATCTGGTT
TATGGCGGCA TCCTGTTTGC CGCGTGGGGG AAATCGGCCC AGCTACCGAT GCAAGCGTGG
CTACCGGACG CAATGGAAGC GCCAACACCG ATCAGCGCCT ATCTCCACGC CGCATCGATG
GTGAAAGTGG GCGTTTACAT TTTTGCCCGC GCCATTATCG ACGGCGGCAA TATCCCGCAT
GTGATTGGCG GCGTTGGCAT GGTCATGGCC CTGGTCACCA TTCTTTATGG ATTTCTGATG
TATTTGCCAC AGCAGGATAT GAAGCGGTTG CTGGCCTGGT CGACCATCAC TCAACTTGGC
TGGATGTTCT TCGGCTTGTC GCTCTCCATC TTCGGCTCGC GGCTGGCGCT GGAGGGTAGC
ATCGCCTACA TCGTCAACCA CGCGTTCGCT AAAAGCCTGT TTTTCCTTGT AGCAGGTGCG
CTGAGTTACA GCTGCGGCAC GCGCTTGTTG CCGCGTCTGC GTGGCGTATT GCACACCCTG
CCGTTGCCTG GCGTGGGTTT CTGCGTGGCA GCGCTGGCAA TTACCGGCGT GCCGCCGTTC
AACGGCTTCT TCAGTAAATT CCCGCTGTTT GCTGCCGGTT TTGCGTTGTC AGTGGAGTAC
TGGATCCTGC TGCCCGCCAT GATTCTGCTG ATGATTGAAT CGGTCGCCAG TTTCGCCTGG
TTTATTCGCT GGTTTGGTCG CGTCGTGCCT GGCAAACCGA GCGAGGCCGT CGCCGATGCC
GCACCGCTGC CAGGGTCAAT GCGCCTGGTG TTGATTGTAC TGATTGTGAT GTCGCTGATT
TCCAGCGTAA TCGCCGCGAC CTGGTTGCAG TAA
 
Protein sequence
MGVLFAALTT LCMLSLISAF YQADKVAVTL TLVNVGDVAL FGLVIDRVST LILFVVVFLG 
LLVTIYSTGY LTDKNREHPH NGTNRYYAFL LVFIGAMAGL VLSSTLLGQL LFFEITGGCS
WALISYYQSD KAQRSALKAL LITHIGSLGL YLAAATLFLQ TGTFALSAMS ELHGDARYLV
YGGILFAAWG KSAQLPMQAW LPDAMEAPTP ISAYLHAASM VKVGVYIFAR AIIDGGNIPH
VIGGVGMVMA LVTILYGFLM YLPQQDMKRL LAWSTITQLG WMFFGLSLSI FGSRLALEGS
IAYIVNHAFA KSLFFLVAGA LSYSCGTRLL PRLRGVLHTL PLPGVGFCVA ALAITGVPPF
NGFFSKFPLF AAGFALSVEY WILLPAMILL MIESVASFAW FIRWFGRVVP GKPSEAVADA
APLPGSMRLV LIVLIVMSLI SSVIAATWLQ