Gene EcolC_1368 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1368 
Symbol 
ID6068134 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1499369 
End bp1500706 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content56% 
IMG OID641600790 
ProductNADH dehydrogenase I subunit F 
Protein accessionYP_001724361 
Protein GI170019407 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID[TIGR01959] NADH-quinone oxidoreductase, F subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.104051 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.821975 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACA TTATCCGTAC TCCCGAAACG CATCCGCTGA CCTGGCGTCT GCGCGATGAC 
AAACAGCCAG TGTGGCTGGA CGAATATCGC AGCAAAAACG GTTACGAAGG CGCGCGTAAG
GCGCTGACCG GGCTGTCTCC GGACGAAATC GTTAATCAGG TAAAAGACGC TGGTCTGAAA
GGGCGCGGTG GCGCGGGCTT TTCTACCGGC CTGAAATGGA GCCTGATGCC GAAAGACGAA
TCCATGAACA TCCGTTACCT GCTGTGTAAC GCCGATGAAA TGGAGCCAGG CACCTATAAA
GACCGCCTGC TGATGGAGCA ACTGCCGCAC CTGCTGGTGG AAGGTATGCT CATCTCCGCG
TTTGCGCTGA AAGCTTACCG TGGCTACATC TTCCTGCGTG GCGAATATAT CGAAGCGGCA
GTAAATCTGC GCCGCGCCAT TGCCGAAGCC ACCGAAGCAG GTCTGCTTGG CAAAAATATT
ATGGGAACAG GTTTTGACTT CGAACTGTTC GTCCATACCG GGGCAGGGCG CTACATCTGC
GGGGAAGAAA CAGCGTTAAT CAACTCCCTG GAAGGGCGTC GTGCTAACCC ACGCTCGAAA
CCCCCCTTCC CGGCAACCTC CGGCGTATGG GGCAAACCGA CCTGTGTCAA CAACGTCGAA
ACCCTGTGTA ACGTTCCGGC GATCCTCGCT AACGGCGTGG AGTGGTATCA GAACATCTCG
AAAAGTAAAG ATGCTGGCAC CAAGCTGATG GGCTTCTCCG GTCGGGTGAA AAATCCGGGA
CTGTGGGAAC TGCCGTTCGG CACCACCGCA CGCGAGATCC TCGAAGATTA CGCCGGTGGT
ATGCGTGATG GTCTGAAATT TAAAGCCTGG CAGCCAGGCG GCGCGGGGAC TGACTTCCTG
ACCGAAGCGC ACCTTGATCT GCCGATGGAA TTCGAAAGTA TCGGTAAAGC GGGCAGCCGT
CTGGGTACGG CGCTGGCGAT GGCGGTTGAC CATGAGATCA ACATGGTGTC GCTGGTGCGT
AACCTGGAAG AGTTTTTCGC CCGTGAGTCC TGCGGCTGGT GTACGCCGTG CCGCGACGGT
CTGCCGTGGA GCGTGAAAAT TCTGCGTGCG CTGGAGCGTG GTGAAGGTCA GCCGGGCGAT
ATCGAAACAC TTGAGCAACT GTGTCGATTC TTAGGCCCGG GTAAAACTTT CTGTGCCCAC
GCACCTGGTG CAGTGGAGCC GTTACAGAGC GCCATCAAAT ATTTCCGCGA AGAATTTGAG
GCGGGAATCA AACAGCCGTT CAGCAATACC CATTTGATTA ATGGGATTCA GCCGAACCTG
CTGAAAGAGC GCTGGTAA
 
Protein sequence
MKNIIRTPET HPLTWRLRDD KQPVWLDEYR SKNGYEGARK ALTGLSPDEI VNQVKDAGLK 
GRGGAGFSTG LKWSLMPKDE SMNIRYLLCN ADEMEPGTYK DRLLMEQLPH LLVEGMLISA
FALKAYRGYI FLRGEYIEAA VNLRRAIAEA TEAGLLGKNI MGTGFDFELF VHTGAGRYIC
GEETALINSL EGRRANPRSK PPFPATSGVW GKPTCVNNVE TLCNVPAILA NGVEWYQNIS
KSKDAGTKLM GFSGRVKNPG LWELPFGTTA REILEDYAGG MRDGLKFKAW QPGGAGTDFL
TEAHLDLPME FESIGKAGSR LGTALAMAVD HEINMVSLVR NLEEFFARES CGWCTPCRDG
LPWSVKILRA LERGEGQPGD IETLEQLCRF LGPGKTFCAH APGAVEPLQS AIKYFREEFE
AGIKQPFSNT HLINGIQPNL LKERW