Gene ECH74115_3423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3423 
SymbolnuoF 
ID6967592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3167750 
End bp3169087 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content57% 
IMG OID643387230 
ProductNADH dehydrogenase I subunit F 
Protein accessionYP_002271693 
Protein GI209395949 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID[TIGR01959] NADH-quinone oxidoreductase, F subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACA TTATCCGTAC TCCCGAAACG CATCCGCTGA CCTGGCGTCT GCGCGATGAC 
AAACAGCCAG TGTGGCTGGA CGAATACCGC AGCAAAAACG GTTACGAAGG GGCGCGTAAG
GCGCTGACCG GGCTGTCTCC GGACGAAATC GTTAATCAGG TAAAAGACGC TGGTCTGAAA
GGGCGCGGCG GCGCGGGCTT TTCGACTGGC TTGAAGTGGA GCCTGATGCC GAAAGACGAA
TCCATGAACA TCCGTTACCT GCTGTGTAAC GCCGATGAAA TGGAGCCGGG CACCTATAAA
GACCGCCTGT TGATGGAGCA ACTGCCGCAC CTGCTGGTGG AAGGTATGCT CATCTCCGCG
TTTGCGCTGA AAGCTTACCG TGGCTACATC TTCCTGCGTG GCGAATATAT TGAAGCGGCA
GTAAATCTGC GCCGCGCCAT TGCCGAAGCG ACTGAAGCGG GATTGCTTGG CAAAAACATT
ATGGGAACAG GTTTCGACTT CGAACTGTTC GTCCATACCG GGGCAGGGCG CTACATCTGC
GGGGAAGAAA CAGCGTTAAT CAACTCCCTG GAAGGGCGTC GTGCTAACCC ACGCTCGAAA
CCCCCCTTCC CGGCAACCTC CGGCGTATGG GGCAAACCGA CCTGTGTCAA CAACGTCGAA
ACCCTGTGTA ACGTTCCGGC GATCCTCGCT AACGGCGTGG AGTGGTATCA GAACATCTCG
AAAAGTAAAG ATGCTGGCAC CAAGCTGATG GGCTTCTCCG GTCGGGTGAA AAATCCGGGA
CTGTGGGAAC TGCCGTTCGG CACCACCGCA CGCGAGATCC TCGAAGATTA CGCCGGTGGT
ATGCGTGACG GTCTGAAATT CAAAGCCTGG CAGCCAGGCG GCGCGGGCAC CGACTTCCTG
ACCGAAGCGC ACCTTGACCT GCCGATGGAA TTCGAAAGTA TCGGTAAAGC GGGCAGCCGT
CTGGGTACGG CGCTGGCGAT GGCGGTTGAC CATGAAATCA ACATGGTGTC GCTGGTGCGT
AACCTGGAAG AGTTTTTCGC CCGTGAGTCC TGCGGCTGGT GTACGCCGTG CCGCGACGGT
CTGCCGTGGA GCGTGAAAAT TCTGCGTGCG CTGGAGCGTG GCGAAGGTCA GCCGGGCGAT
ATCGAAACAC TTGAGCAACT GTGTCGATTC TTAGGCCCGG GTAAAACTTT CTGTGCCCAC
GCACCTGGTG CTGTGGAGCC GTTACAGAGC GCCATCAAAT ATTTCCGCGA AGAATTTGAG
GCGGGGATCA AACAGCCGTT CAGCAATACC CATTTGATTA ATGGGATTCA GCCGAACCTG
CTGAAAGAGC GCTGGTAA
 
Protein sequence
MKNIIRTPET HPLTWRLRDD KQPVWLDEYR SKNGYEGARK ALTGLSPDEI VNQVKDAGLK 
GRGGAGFSTG LKWSLMPKDE SMNIRYLLCN ADEMEPGTYK DRLLMEQLPH LLVEGMLISA
FALKAYRGYI FLRGEYIEAA VNLRRAIAEA TEAGLLGKNI MGTGFDFELF VHTGAGRYIC
GEETALINSL EGRRANPRSK PPFPATSGVW GKPTCVNNVE TLCNVPAILA NGVEWYQNIS
KSKDAGTKLM GFSGRVKNPG LWELPFGTTA REILEDYAGG MRDGLKFKAW QPGGAGTDFL
TEAHLDLPME FESIGKAGSR LGTALAMAVD HEINMVSLVR NLEEFFARES CGWCTPCRDG
LPWSVKILRA LERGEGQPGD IETLEQLCRF LGPGKTFCAH APGAVEPLQS AIKYFREEFE
AGIKQPFSNT HLINGIQPNL LKERW