Gene EcSMS35_3303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3303 
Symbol 
ID6142919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3379032 
End bp3380048 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content47% 
IMG OID641618133 
Productzinc-binding dehydrogenase family protein 
Protein accessionYP_001745283 
Protein GI170683214 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0731443 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAT TAATTTGTCA GCAGCCTGGC GTTATGGAAT ATGTGGAAAA GGATATTCCC 
ACACCAGCAG ATAATGAAGT GCTGTTAAAA ATCAAAGCTG TGGGTATTTG TGGTACTGAT
ATTCACGCTT TTGCCGGCAG ACAGCCTTTT TTTAGCTACC CACGTGTATT AGGTCATGAA
ATATGCGCCG AAGCGGTTTC GCGAGGCAGC CAGTGCCAAA CAGCACAATC AGGCCAGCGC
TATTCCGTCA TCCCATGCAT TCCGTGTGGC GAGTGCGCAG CCTGTCGGGA AGAGAAAACG
AACTGCTGCG AACGTGTTTC GCTGTATGGC GTGCATCAGG ATGGGGGTTT TAGTGAGTAC
CTTGCGGTAC GTGAAGACAA CCTTGTGCCT CTCCCTGACG AGGTCAGCGA CAGTGCCGGA
GCATTGGTTG AATGTTTCGC CATTGGTGCA CATGCCGTTC GTCGGGCAGA GATCAAGGCT
GAACAAAACG TACTGGTGAT TGGTGCTGGG CCAATCGGTT TGGCTACCGC AGCCATCGCC
AGGGCTAAAG GGGCGCATGT TGTTGTTGCT GATATTGACT GTCAACGTCG CCAGCACGTT
GTGGATCATC TGGCAATTAA TGTCTTCGAC CCAACACAGG AAGGTTTTAT TGCCGCGCTT
AGTGAAGTAT TTGGAGGCGA ACTGGCTTGC GTAGTACTGG ATGCGACGGG AAATAAAGCT
TCAATGAGTC ATGATGTAAA TCTTATTCGT CATGGCGGCA AAATTGTTTT CATCGGTTTG
TACATTGGTG AACTTGTTAT TGACGATCCG ACCTTCCATA AAAAAGAGAC AACGTTACTC
AGCAGCCGCA ATGCCACACG GGAAGATTTT GCGTTGGTGA TTGAACTGAT GCGCAGCAAT
AAAATTCACG AAAATTTAAT GAAAAACCAG GCGTTCAATT TCTTTAGTGT TGGCGAAGAT
TACCAGCGTA ACGTTGTAGA AAATAAAAAT ATGGTCAAGG GTGTGATCAC TTTTTAA
 
Protein sequence
MKTLICQQPG VMEYVEKDIP TPADNEVLLK IKAVGICGTD IHAFAGRQPF FSYPRVLGHE 
ICAEAVSRGS QCQTAQSGQR YSVIPCIPCG ECAACREEKT NCCERVSLYG VHQDGGFSEY
LAVREDNLVP LPDEVSDSAG ALVECFAIGA HAVRRAEIKA EQNVLVIGAG PIGLATAAIA
RAKGAHVVVA DIDCQRRQHV VDHLAINVFD PTQEGFIAAL SEVFGGELAC VVLDATGNKA
SMSHDVNLIR HGGKIVFIGL YIGELVIDDP TFHKKETTLL SSRNATREDF ALVIELMRSN
KIHENLMKNQ AFNFFSVGED YQRNVVENKN MVKGVITF