Gene EcolC_2006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2006 
Symbol 
ID6068085 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2212367 
End bp2213407 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content53% 
IMG OID641601420 
Productputative oxidoreductase 
Protein accessionYP_001724979 
Protein GI170020025 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00151609 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.380811 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA ACATCCGTGT TGGGTTGATT GGGTATGGTT ATGCGAGCAA AACCTTCCAT 
GCGCCCCTGA TTGCGGGCAC GCCCGGGCTG GAACTGGCGG TAATCTCCAG CAGTGATGAA
ACAAAAGTAA AAGCCGACTG GCCAACGGTT ACGGTTGTCT CTGAGCCGAA GCATCTGTTT
AACGATCCCA ACATAGACCT GATTGTCATT CCTACACCCA ACGATACCCA TTTCCCGTTA
GCCAAAGCGG CGCTTGAGGC GGGTAAACAT GTGGTCGTTG ATAAACCCTT TACCGTGACA
CTGTCACAAG CGCGAGAGCT GGATGCGCTG GCAAAAAGCC TGGGGCGTGT GCTGTCTGTA
TTCCATAACC GTCGCTGGGA TAGCGATTTC TTGACGCTAA AAGGTTTACT CGCGGAAGGC
GTGCTGGGTG AAGTTGCTTA CTTTGAGTCT CATTTTGACC GCTTCCGTCC GCAGGTGCGC
GATCGTTGGC GTGAACAGGG CGGCCCAGGC AGCGGTATCT GGTACGATTT AGCACCACAT
CTTCTTGATC AGGCCATTAC GCTTTTTGGT TTACCGGTCA GCATGACGGT AGATTTGGCA
CAGTTACGGC CCGGAGCGCA GTCGACCGAT TATTTCCACG CCATCTTGTC CTATCCACAG
CGGCGAGTCA TTTTACACGG TACCATGCTG GCAGCCGCTG AGTCAGCACG GTATATCGTG
CATGGATCCC GAGGCAGTTA TGTGAAATAT GGCCTCGATC CACAGGAAGA ACGTCTGAAA
AATGGCGAGC GTCTACCGCA GGAAGACTGG GGCTACGATA TGCGTGATGG CGTACTTACC
CGCGTGGAAG GTGAGGAACG TGTCGAAGAA ACGCTGTTGA CGGTGCCTGG GAATTATCCG
GCTTACTATG CGGCTATTCG TGATGCGTTA AATGGCGATG GTGAAAATCC GGTTCCGGCA
AGCCAGGCAA TCCAGGTAAT GGAGTTGATT GAGCTGGGCA TCGAATCCGC CAAACATCGC
GCGACTTTGT GCCTTGCATG A
 
Protein sequence
MSDNIRVGLI GYGYASKTFH APLIAGTPGL ELAVISSSDE TKVKADWPTV TVVSEPKHLF 
NDPNIDLIVI PTPNDTHFPL AKAALEAGKH VVVDKPFTVT LSQARELDAL AKSLGRVLSV
FHNRRWDSDF LTLKGLLAEG VLGEVAYFES HFDRFRPQVR DRWREQGGPG SGIWYDLAPH
LLDQAITLFG LPVSMTVDLA QLRPGAQSTD YFHAILSYPQ RRVILHGTML AAAESARYIV
HGSRGSYVKY GLDPQEERLK NGERLPQEDW GYDMRDGVLT RVEGEERVEE TLLTVPGNYP
AYYAAIRDAL NGDGENPVPA SQAIQVMELI ELGIESAKHR ATLCLA