Gene EcolC_1374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1374 
Symbol 
ID6068098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1505878 
End bp1507719 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content57% 
IMG OID641600796 
ProductNADH dehydrogenase subunit L 
Protein accessionYP_001724367 
Protein GI170019413 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG1009] NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit 
TIGRFAM ID[TIGR01974] proton-translocating NADH-quinone oxidoreductase, chain L 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.497316 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATGC TTGCCTTAAC CATTATTTTG CCATTGATTG GCTTCGTCCT GCTGGCATTC 
TCCCGTGGGC GCTGGTCTGA AAACGTCTCG GCGATCGTCG GCGTAGGCTC TGTGGGCCTG
GCGGCGCTGG TAACCGCCTT TATCGGCGTT GATTTCTTCG CTAACGGCGA GCAGACATAC
AGCCAGCCGC TGTGGACGTG GATGTCGGTA GGCGACTTTA ACATCGGTTT TAACCTGGTG
CTGGACGGCC TGTCGCTGAC CATGCTCTCG GTGGTCACTG GTGTGGGTTT CCTTATTCAC
ATGTACGCCT CCTGGTATAT GCGCGGTGAA GAGGGCTACT CTCGCTTCTT CGCTTACACC
AACCTGTTCA TCGCCAGCAT GGTGGTTCTG GTGCTTGCCG ACAACCTGCT GCTGATGTAC
CTCGGCTGGG AAGGCGTGGG CCTGTGCTCC TATCTGCTGA TCGGGTTCTA TTACACCGAT
CCGAAGAATG GCGCAGCGGC AATGAAAGCG TTCGTCGTGA CCCGTGTGGG TGACGTGTTC
CTCGCTTTCG CACTGTTCAT TCTTTACAAC GAACTGGGCA CCCTGAACTT CCGCGAAATG
GTGGAACTGG CACCAGCGCA CTTTGCTGAC GGCAATAACA TGCTGATGTG GGCGACGCTG
ATGCTGCTGG GCGGTGCGGT CGGTAAATCT GCGCAGTTGC CGTTGCAGAC ATGGCTTGCT
GACGCGATGG CGGGCCCGAC GCCTGTCTCC GCGCTGATCC ACGCCGCAAC GATGGTAACC
GCGGGTGTCT ACCTGATCGC CCGTACCCAC GGCCTGTTCC TGATGACGCC GGAAGTTCTG
CATCTGGTGG GTATTGTCGG GGCGGTTACG CTGCTGCTGG CCGGTTTTGC CGCGCTGGTA
CAGACCGACA TCAAACGTGT TCTCGCTTAC TCTACCATGA GCCAGATTGG CTACATGTTC
CTCGCGCTTG GCGTGCAGGC ATGGGATGCG GCGATTTTCC ACTTGATGAC TCACGCGTTC
TTTAAAGCGC TGCTGTTCCT GGCATCCGGT TCCGTCATTC TGGCCTGCCA TCACGAACAG
AACATCTTCA AGATGGGCGG TCTGCGTAAA TCTATTCCGC TGGTTTATCT CTGCTTCCTG
GTGGGCGGCG CAGCACTGTC GGCACTGCCG CTGGTCACTG CGGGCTTCTT CAGTAAGGAT
GAGATCCTCG CGGGTGCGAT GGCGAATGGT CATATCAATC TGATGGTGGC AGGTCTGGTC
GGTGCGTTTA TGACCTCGCT CTACACCTTC CGTATGATTT TCATCGTCTT CCACGGAAAA
GAACAAATTC ACGCTCACGC CGTGAAAGGG GTAACTCACA GCCTGCCGCT GATTGTGCTG
CTGATCCTTT CCACCTTCGT TGGCGCACTG ATTGTACCGC CGCTGCAGGG CGTGCTTCCG
CAAACGACGG AACTGGCGCA CGGCAGCATG TTGACCCTGG AAATTACCTC TGGCGTGGTC
GCGGTGGTCG GCATTCTGCT GGCAGCCTGG CTGTGGCTGG GTAAACGTAC TCTGGTGACC
TCCATCGCCA ACAGTGCGCC GGGCCGCCTG CTGAGTACCT GGTGGTACAA CGCCTGGGGC
TTTGACTGGC TGTACGACAA AGTGTTCGTC AAGCCGTTCC TGGGTATTGC CTGGTTGCTG
AAACGCGATC CGCTGAACTC AATGATGAAC ATCCCGGCGG TTCTTTCCCG CTTTGCAGGT
AAAGGTCTGC TGTTAAGCGA GAACGGTTAT CTGCGCTGGT ATGTGGCATC CATGAGCATC
GGTGCGGTCG TGGTGCTGGC ACTGTTGATG GTACTGCGTT GA
 
Protein sequence
MNMLALTIIL PLIGFVLLAF SRGRWSENVS AIVGVGSVGL AALVTAFIGV DFFANGEQTY 
SQPLWTWMSV GDFNIGFNLV LDGLSLTMLS VVTGVGFLIH MYASWYMRGE EGYSRFFAYT
NLFIASMVVL VLADNLLLMY LGWEGVGLCS YLLIGFYYTD PKNGAAAMKA FVVTRVGDVF
LAFALFILYN ELGTLNFREM VELAPAHFAD GNNMLMWATL MLLGGAVGKS AQLPLQTWLA
DAMAGPTPVS ALIHAATMVT AGVYLIARTH GLFLMTPEVL HLVGIVGAVT LLLAGFAALV
QTDIKRVLAY STMSQIGYMF LALGVQAWDA AIFHLMTHAF FKALLFLASG SVILACHHEQ
NIFKMGGLRK SIPLVYLCFL VGGAALSALP LVTAGFFSKD EILAGAMANG HINLMVAGLV
GAFMTSLYTF RMIFIVFHGK EQIHAHAVKG VTHSLPLIVL LILSTFVGAL IVPPLQGVLP
QTTELAHGSM LTLEITSGVV AVVGILLAAW LWLGKRTLVT SIANSAPGRL LSTWWYNAWG
FDWLYDKVFV KPFLGIAWLL KRDPLNSMMN IPAVLSRFAG KGLLLSENGY LRWYVASMSI
GAVVVLALLM VLR