Gene EcSMS35_0620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0620 
Symbol 
ID6144000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp633243 
End bp634331 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content56% 
IMG OID641615512 
Producthypothetical protein 
Protein accessionYP_001742718 
Protein GI170684067 
COG category[C] Energy production and conversion 
COG ID[COG0371] Glycerol dehydrogenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.505795 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCACA ATCCTATCCG CGTGGTCGTC GGCCCGGCTA ACTACTTTTC ACATCCCGGA 
AGTTTCAATC ACCTGCACGA TTTTTTCACT GATGAACAAC TTTCTCGCGC GGCGTGGATC
TACGGCGAAC GCGCCATTGC TGCGGCGCAA ACAAAACTTC CGCCAGCGTT TGAACTGCCA
GGGGCAAAGC ATATTTTGTT TCGTGGTCAT TGCAGCGAAA GCGATGTACA ACAACTGGCG
GCTGAGTCCG GTGACGATCG CAGCGTGGTG ATTGGCGTCG GCGGCGGCGC ACTGCTCGAC
ACCGCGAAAG CCCTCGCCCG CCGTCTCGGT CTGCCGTTTG TTGCCGTTCC GACGATCGCC
GCTACCTGCG CAGCCTGGAC ACCGCTCTCT GTCTGGTACA ACGATGCCGG ACAAGCGCTG
CATTATGAGA TTTTCGACGA CGCCAATTTT ATGGTGCTGG TGGAACCGGA GATTATCCTC
AACGCGCCGC AAGAATATCT GCTGGCGGGG ATCGGTGACA CGCTGGCGAA ATGGTATGAA
GCGGTGGTGC TGGCCCCGCA ACCAGAAACG TTGCCGTTAA CCGTGCGGCT GGGGATTAAT
AACGCGCAGG CCATTCGCGA CGTCTTGTTA AACAGTAGCG AACAGGCGCT TGCCGATCAG
CAAAATCACC AGTTAACGCA ATCATTTTGC GATGTGGTGG ATGCCATTAT TGCTGGTGGT
GGGATGGTTG GTGGTCTGGG CGATCGTTTT ACGCGTGTGG CGGCAGCTCA TGCCGTGCAT
AACGGTCTGA CCGTGCTGCC GCAAACCGAG AAGTTTCTCC ACGGCACCAA AGTCGCCTAC
GGAATTCTGG TGCAAAGCGC CTTGCTGGGT CAGGATGATG TGCTGGCGCA ATTAACTGGA
GCGTATCAGC GTTTTCATCT GCCGACTACG CTGGCGGAGC TGGAAGTGGA TATCAATAAT
CAGGCGGAGA TCGACAAAGT GATTGCCCAC ACCTTACGTC CGGTGGAATC CATTCATTAC
CTGCCGGTCA CGCTGACACC AGATACGTTG CGTGCAGCGT TCGAAAAAGT GGAATCGTTT
AAAGCCTGA
 
Protein sequence
MPHNPIRVVV GPANYFSHPG SFNHLHDFFT DEQLSRAAWI YGERAIAAAQ TKLPPAFELP 
GAKHILFRGH CSESDVQQLA AESGDDRSVV IGVGGGALLD TAKALARRLG LPFVAVPTIA
ATCAAWTPLS VWYNDAGQAL HYEIFDDANF MVLVEPEIIL NAPQEYLLAG IGDTLAKWYE
AVVLAPQPET LPLTVRLGIN NAQAIRDVLL NSSEQALADQ QNHQLTQSFC DVVDAIIAGG
GMVGGLGDRF TRVAAAHAVH NGLTVLPQTE KFLHGTKVAY GILVQSALLG QDDVLAQLTG
AYQRFHLPTT LAELEVDINN QAEIDKVIAH TLRPVESIHY LPVTLTPDTL RAAFEKVESF
KA