Gene EcolC_0991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0991 
Symbol 
ID6067744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1077591 
End bp1079300 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content57% 
IMG OID641600399 
ProductNADH dehydrogenase (ubiquinone) 30 kDa subunit 
Protein accessionYP_001723987 
Protein GI170019033 
COG category[C] Energy production and conversion 
COG ID[COG3261] Ni,Fe-hydrogenase III large subunit
[COG3262] Ni,Fe-hydrogenase III component G 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0591467 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAG AAAAATTAGG TCAACATTAT CTCGCCGCGC TGAATGAGGC ATTTCCGGGC 
GTCGTGCTGG ACCACGCCTG GCAGACCAAA GATCAGCTGA CTGTCACCGT AAAGGTGAAC
TACCTGCCGG AAGTGGTGGA GTTTCTTTAC TACAAACAGG GTGGCTGGCT GTCGGTGCTG
TTTGGTAACG ACGAACGCAA ACTGAATGGT CATTACGCCG TTTACTACGT GCTGTCGATG
GAGAAGGGCA CTAAGTGTTG GATTACGGTT CGCGTCGAAG TTGACGCCAA CAAACCGGAA
TATCCGTCCG TGACGCCGCG CGTTCCGGCG GCGGTGTGGG GCGAGCGTGA AGTGCGCGAT
ATGTACGGTT TGATTCCGGT TGGTCTGCCG GATGAACGTC GTCTGGTGCT GCCGGATGAC
TGGCCGGATG AACTTTATCC GCTGCGTAAA GACAGCATGG ATTATCGTCA GCGTCCGGCA
CCGACCACCG ATGCTGAAAC CTACGAGTTC ATCAACGAAC TGGGCGACAA GAAAAACAAC
GTCGTGCCGA TTGGTCCGCT GCACGTCACT TCTGATGAAC CGGGCCACTT CCGTCTGTTC
GTCGATGGCG AAAACATTAT CGACGCCGAC TACCGTCTGT TCTACGTCCA TCGCGGCATG
GAAAAACTGG CGGAAACCCG TATGGGTTAT AACGAAGTGA CCTTCCTCTC TGACCGTGTG
TGCGGGATCT GCGGCTTTGC CCACAGCACC GCCTACACCA CGTCGGTGGA AAACGCGATG
GGTATTCAGG TGCCAGAACG TGCGCAGATG ATCCGCGCCA TTCTGCTGGA GGTAGAACGC
TTGCACTCGC ATCTGCTCAA CCTTGGCCTG GCCTGTCACT TTACCGGCTT CGACTCCGGC
TTTATGCAGT TCTTCCGCGT GCGTGAAACC TCCATGAAAA TGGCAGAGAT CCTTACCGGT
GCGCGTAAAA CCTACGGCCT GAACTTGATC GGCGGGATTC GTCGCGATCT GCTGAAAGAC
GACATGATCC AGACCCGCCA GCTGGCACAA CAGATGCGTC GTGAAGTGCA GGAGCTGGTG
GATGTGCTGC TGAGCACTCC GAACATGGAA CAGCGCACTG TCGGCATTGG TCGTCTGGAC
CCGGAAATCG CTCGCGACTT CAGTAACGTC GGCCCGATGG TCCGTGCCAG CGGTCACGCC
CGTGATACCC GCGCCGATCA CCCGTTTGTC GGCTATGGCC TGCTGCCAAT GGAAGTCCAC
AGCGAGCAGG GCTGCGACGT TATTTCCCGT CTGAAAGTGC GTATCAACGA AGTCTATACC
GCGCTGAACA TGATCGACTA CGGTCTGGAT AACCTGCCGG GTGGCCCACT GATGGTGGAA
GGCTTTACCT ACATTCCGCA CCGCTTTGCG CTGGGCTTTG CCGAAGCGCC GCGCGGCGAT
GATATCCACT GGAGCATGAC CGGCGACAAC CAGAAGCTGT ACCGCTGGCG CTGCCGTGCC
GCGACCTACG CGAACTGGCC GACCCTGCGC TACATGCTGC GCGGCAACAC CGTTTCCGAT
GCGCCGCTGA TTATCGGTAG CCTCGACCCT TGCTACTCCT GTACCGACCG CATGACCGTG
GTCGATGTGC GTAAGAAGAA GAGCAAAGTG GTGCCGTACA AAGAACTCGA GCGTTACAGC
ATTGAGCGTA AAAACTCGCC GCTGAAATAA
 
Protein sequence
MSEEKLGQHY LAALNEAFPG VVLDHAWQTK DQLTVTVKVN YLPEVVEFLY YKQGGWLSVL 
FGNDERKLNG HYAVYYVLSM EKGTKCWITV RVEVDANKPE YPSVTPRVPA AVWGEREVRD
MYGLIPVGLP DERRLVLPDD WPDELYPLRK DSMDYRQRPA PTTDAETYEF INELGDKKNN
VVPIGPLHVT SDEPGHFRLF VDGENIIDAD YRLFYVHRGM EKLAETRMGY NEVTFLSDRV
CGICGFAHST AYTTSVENAM GIQVPERAQM IRAILLEVER LHSHLLNLGL ACHFTGFDSG
FMQFFRVRET SMKMAEILTG ARKTYGLNLI GGIRRDLLKD DMIQTRQLAQ QMRREVQELV
DVLLSTPNME QRTVGIGRLD PEIARDFSNV GPMVRASGHA RDTRADHPFV GYGLLPMEVH
SEQGCDVISR LKVRINEVYT ALNMIDYGLD NLPGGPLMVE GFTYIPHRFA LGFAEAPRGD
DIHWSMTGDN QKLYRWRCRA ATYANWPTLR YMLRGNTVSD APLIIGSLDP CYSCTDRMTV
VDVRKKKSKV VPYKELERYS IERKNSPLK