Gene EcHS_A2433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2433 
SymbolnuoF 
ID5592078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2441789 
End bp2443126 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content56% 
IMG OID640921556 
ProductNADH dehydrogenase I subunit F 
Protein accessionYP_001459090 
Protein GI157161772 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID[TIGR01959] NADH-quinone oxidoreductase, F subunit 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAACA TTATCCGTAC TCCCGAAACG CATCCGCTGA CCTGGCGTCT GCGCGATGAC 
AAACAGCCAG TGTGGCTGGA CGAATACCGC AGCAAAAACG GTTACGAAGG GGCGCGTAAG
GCGCTGACCG GGCTGTCTCC GGACGAAATC GTTAATCAGG TAAAAGACGC TGGTCTGAAA
GGGCGCGGCG GCGCGGGCTT TTCGACTGGC CTGAAGTGGA GCCTGATGCC GAAAGACGAA
TCCATGAACA TCCGTTACCT GCTGTGTAAC GCCGATGAAA TGGAGCCGGG CACCTATAAA
GACCGCCTGT TGATGGAGCA ACTGCCGCAC CTGCTGGTGG AAGGTATGCT CATCTCCGCG
TTTGCGCTGA AAGCTTACCG TGGCTACATC TTCCTGCGTG GCGAATATAT CGAAGCGGCA
GTAAATCTGC GCCGCGCCAT TGCCGAAGCC ACTGAAGCGG GCTTGCTTGG CAAAAACATT
ATGGGAACAG GTTTCGACTT CGAACTGTTC GTCCATACCG GGGCAGGGCG CTATATCTGC
GGGGAAGAAA CAGCGTTAAT CAACTCCCTG GAAGGGCGTC GTGCTAACCC ACGCTCGAAA
CCCCCCTTCC CGGCAACCTC CGGCGTATGG GGCAAACCGA CCTGTGTCAA CAACGTCGAA
ACCCTGTGTA ACGTTCCGGC GATCCTCGCT AACGGCGTGG AGTGGTATCA GAACATCTCG
AAAAGTAAAG ATGCTGGCAC CAAGCTGATG GGCTTCTCCG GTCGGGTGAA AAATCCGGGA
CTGTGGGAAC TGCCGTTTGG TACTACCGCG CGCGAGATCC TCGAAGATTA CGCCGGTGGT
ATGCGTGACG GTCTGAAATT CAAAGCCTGG CAGCCAGGCG GCGCGGGCAC CGACTTCCTG
ACCGAAGCGC ACCTTGACCT GCCGATGGAA TTCGAAAGTA TCGGTAAAGC GGGAAGCCGT
CTGGGTACGG CGCTGGCGAT GGCGGTTGAC CATGAGATCA ACATGGTGTC GCTGGTGCGT
AACCTGGAAG AGTTTTTCGC CCGTGAGTCC TGCGGCTGGT GTACGCCGTG CCGCGACGGT
CTGCCGTGGA GCGTGAAAAT TCTGCGTGCG CTGGAGCGTG GTGAAGGTCA GCCGGGCGAT
ATCGAAACAC TTGAGCAACT GTGTCGATTC TTAGGCCCGG GTAAAACTTT CTGTGCCCAC
GCACCTGGTG CAGTGGAGCC GTTACAGAGC GCCATCAAAT ATTTCCGCGA AGAATTTGAG
GCGGGAATCA AACAGCCGTT CAGCAATACC CATTTGATTA ATGGGATTCA GCCGAACCTG
CTGAAAGAGC GCTGGTAA
 
Protein sequence
MKNIIRTPET HPLTWRLRDD KQPVWLDEYR SKNGYEGARK ALTGLSPDEI VNQVKDAGLK 
GRGGAGFSTG LKWSLMPKDE SMNIRYLLCN ADEMEPGTYK DRLLMEQLPH LLVEGMLISA
FALKAYRGYI FLRGEYIEAA VNLRRAIAEA TEAGLLGKNI MGTGFDFELF VHTGAGRYIC
GEETALINSL EGRRANPRSK PPFPATSGVW GKPTCVNNVE TLCNVPAILA NGVEWYQNIS
KSKDAGTKLM GFSGRVKNPG LWELPFGTTA REILEDYAGG MRDGLKFKAW QPGGAGTDFL
TEAHLDLPME FESIGKAGSR LGTALAMAVD HEINMVSLVR NLEEFFARES CGWCTPCRDG
LPWSVKILRA LERGEGQPGD IETLEQLCRF LGPGKTFCAH APGAVEPLQS AIKYFREEFE
AGIKQPFSNT HLINGIQPNL LKERW