Gene EcolC_3214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3214 
Symbol 
ID6066697 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3521512 
End bp3522486 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content54% 
IMG OID641602629 
Productaldo/keto reductase 
Protein accessionYP_001726163 
Protein GI170021209 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000166082 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAATACA ACCCCTTAGG AAAAACCGAC CTTCGCGTTT CCCGACTTTG CCTCGGCTGT 
ATGACCTTTG GCGAGCCAGA TCGTGGTAAT CACGCATGGA CACTGCCGGA AGAAAGCAGC
CGTCCCATAA TTAAACGCGC GCTGGAAGGC GGCATAAATT TCTTTGACAC CGCCAATAGC
TATTCCGATG GCAGCAGCGA AGAGATCGTC GGTCGCGCAC TGCGGGATTT CGCCCGTCGT
GAAGACGTGG TCGTTGCCAC CAAAGTATTC CATCGCGTTG GTGATTTGCC GGAAGGATTA
TCCCGTGCGC AAATATTGCG CTCTATCGAC GACAGCCTGC GCCGTCTCGG CATGGATTAT
GTCGATATCC TGCAAATTCA TCGCTGGGAT TACAACACGC CGATCGAAGA GACGCTGGAA
GCCCTGAACG ACGTGGTAAA AGCCGGGAAA GCGCGTTATA TCGGCGCGTC ATCAATGCAC
GCTTCGCAGT TTGCTCAGGC ACTGGAACTC CAAAAACAGC ATGGCTGGGC GCAGTTTGTC
AGTATGCAGG ATCACTACAA TCTGATTTAT CGCGAAGAAG AGCGCGAGAT GCTGCCACTG
TGTTATCAGG AGGGCGTGGC GGTGATTCCG TGGAGCCCGC TGGCGCGGGG GCGACTGACG
CGTCCGTGGG GAGAAACTAC CGCACGACTG GTGTCTGATG AGGTGGGGAA AAATCTCTAT
AAAGAAAGCG ATGAAAATGA CGCGCAGATC GCAGAGCGGT TAACGGGCGT CAGTGAAGAA
CTCGGTGCAA CACGAGCACA AGTTGCGCTG GCCTGGTTGT TGAGTAAACC GGGCATTGCC
GCACCGATTA TCGGTACATC GCGGGAAGAA CAGCTTGATG AGCTATTGAA CGCGGTGGAT
ATCACTTTGA AGCCGGAACA GATTGCCGAA CTGGAAACGC CGTATAAACC GCATCCGGTA
GTAGGATTTA AATAA
 
Protein sequence
MQYNPLGKTD LRVSRLCLGC MTFGEPDRGN HAWTLPEESS RPIIKRALEG GINFFDTANS 
YSDGSSEEIV GRALRDFARR EDVVVATKVF HRVGDLPEGL SRAQILRSID DSLRRLGMDY
VDILQIHRWD YNTPIEETLE ALNDVVKAGK ARYIGASSMH ASQFAQALEL QKQHGWAQFV
SMQDHYNLIY REEEREMLPL CYQEGVAVIP WSPLARGRLT RPWGETTARL VSDEVGKNLY
KESDENDAQI AERLTGVSEE LGATRAQVAL AWLLSKPGIA APIIGTSREE QLDELLNAVD
ITLKPEQIAE LETPYKPHPV VGFK