Gene EcolC_3044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3044 
Symbol 
ID6065944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3323759 
End bp3324847 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content55% 
IMG OID641602460 
Producthypothetical protein 
Protein accessionYP_001725995 
Protein GI170021041 
COG category[C] Energy production and conversion 
COG ID[COG0371] Glycerol dehydrogenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCACA ATCCTATCCG CGTGGTCGTC GGCCCGGCTA ACTACTTTTC ACATCCAGGA 
AGTTTCAATC ACCTGCACGA TTTTTTCACT GATGAACAAC TTTCTCGCGC GGTGTGGATC
TACGGCAAAC GCGCCATTGC TGCGGCGCAA ACCAAACTTC CGCCAGCGTT TGGACTGCCA
GGGGCAAAGC ATATTTTGTT TCGCGGTCAT TGCAGCGAAA GCGATGTACA ACAACTGGCG
GCTGAGTCCG GTGACGACCG CAGCGTGGTG ATTGGCGTCG GTGGCGGTGC ACTGCTCGAC
ACCGCGAAAG CCCTCGCCCG CCGTCTCGGT CTGCCGTTTG TTGCCGTTCC GACGATCGCC
GCCACCTGCG CCGCCTGGAC ACCGCTCTCC GTCTGGTATA ATGATGCCGG ACAGGCGCTG
CATTATGAGA TTTTCGACGA CGCCAATTTT ATGGTGCTGG TGGAACCGGA GATTATCCTC
AATGCACCGC AACAATATCT GCTGGCGGGG ATCGGTGACA CGCTGGCGAA ATGGTATGAA
GCGGTGGTGC TGGCTCCGCA ACCAGAAACG TTGCCGCTAA CCGTGCGACT GGGGATCAAT
AATGCGCAAG CCATTCGCGA CGTCTTGTTA AACAGTAGCG AACAGGCGCT GAGCGATCAG
CAAAATCAAC AGTTAACGCA ATCATTTTGC GATGTGGTGG ATGCTATTAT TGCTGGTGGT
GGGATGGTTG GTGGTCTGGG CGATCGTTTT ACGCGTGTGG CGGCAGCTCA TGCCGTGCAT
AACGGTCTGA CCGTGCTGCC GCAAACCGAG AAGTTTCTCC ACGGCACCAA AGTCGCCTAC
GGAATTCTGG TGCAAAGCGC CTTGCTGGGT CAGGATGATG TGCTGGCGCA ATTAACTGGA
GCGTATCAGC GTTTTCATCT GCCGACTACA CTGGCGGAGC TGGAAGTGGA TATCAATAAT
CAGGCGGAGA TCGACAAAGT GATTGCCCAC ACCCTGCGTC CGGTGGAGTC CATTCATTAC
CTGCCAGTCA CGCTGACACC AGATACGTTG CGTGCAGCGT TCAAAAAAGT GGAATCGTTT
AAAGCCTGA
 
Protein sequence
MPHNPIRVVV GPANYFSHPG SFNHLHDFFT DEQLSRAVWI YGKRAIAAAQ TKLPPAFGLP 
GAKHILFRGH CSESDVQQLA AESGDDRSVV IGVGGGALLD TAKALARRLG LPFVAVPTIA
ATCAAWTPLS VWYNDAGQAL HYEIFDDANF MVLVEPEIIL NAPQQYLLAG IGDTLAKWYE
AVVLAPQPET LPLTVRLGIN NAQAIRDVLL NSSEQALSDQ QNQQLTQSFC DVVDAIIAGG
GMVGGLGDRF TRVAAAHAVH NGLTVLPQTE KFLHGTKVAY GILVQSALLG QDDVLAQLTG
AYQRFHLPTT LAELEVDINN QAEIDKVIAH TLRPVESIHY LPVTLTPDTL RAAFKKVESF
KA