Gene EcSMS35_1620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1620 
SymbolrspB 
ID6143563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1609913 
End bp1610932 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content45% 
IMG OID641616496 
Productputative dehydrogenase 
Protein accessionYP_001743674 
Protein GI170681959 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00863317 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGCA TTTTAATTGA AAAACCGAAT CAACTGGCAA TTATCGAACG CGAAATACCC 
ACCCCGTCAG CGGGTGAAGT ACGAGTAAAA GTGAAACTTG CCGGAATTTG TGGTTCAGAT
AGCCATATTT ATCGTGGGCA TAATCCTTTT GCGAAATATC CGCGCGTCAT TGGACATGAA
TTCTTTGGCG TCATTGATGC AGTGGGTGAA GGCGTGGAAA GCGACAGAGT CGGTGAACGC
GTTGCTGTCG ATCCGGTGGT CAGCTGTGGG CATTGCTATC CGTGCTCTAT AGGTAAGCCG
AACGTTTGTA CGACACTGGC TGTATTAGGT GTGCACGCTG ACGGTGGTTT CAGTGAATAT
GCCGTGGTGC CGGCAAAAAA TGCGTGGAAA ATTCCTGAAG CAGTGGCCGA TCAATATGCG
GTGATGATTG AACCTTTTAC CATTGCGGCT AACGTTACCG GTCATGGTCA ACCGACTGAA
AATGATACCG TTCTGGTTTA CGGTGCAGGT CCAATCGGCC TGACGATCGT TCAGGTATTA
AAAGGCGTCT ATAACGTTAA GAATGTGATT GTTGCCGATC GCATTGATGA ACGACTGGAA
AAAGCGAAAG AGAGCGGGGC AGACTGGGCG ATTAATAACA GCCAGACACC GCTTAGCGAG
AGTTTCGCTG AAAAAGGCAT CAAGCCGACA TTAATTATCG ATGCGGCTTG TCATCCTTCA
ATCCTGAAAG AAGCCGTAAC GCTGGCTTCT CCAGCGGCAC GTATTGTATT GATGGGCTTC
TCCAGTGAAC CGTCTGAAGT GATTCAGCAA GGAATTACCG GAAAAGAACT CTCTATTTTC
TCTTCACGCT TAAATGCAAA TAAATTCCCG GTCGTTATCG ACTGGTTAAG TAAAGGGTTA
ATTAAACCAG AAAAACTAAT TACCCATACG TTTGATTTCC AGCATGTTGC TGATGCCATT
AGTTTATTTG AACAGGATCA AAAGCATTGC TGCAAAGTCT TACTCACTTT TTCTGAATAA
 
Protein sequence
MKSILIEKPN QLAIIEREIP TPSAGEVRVK VKLAGICGSD SHIYRGHNPF AKYPRVIGHE 
FFGVIDAVGE GVESDRVGER VAVDPVVSCG HCYPCSIGKP NVCTTLAVLG VHADGGFSEY
AVVPAKNAWK IPEAVADQYA VMIEPFTIAA NVTGHGQPTE NDTVLVYGAG PIGLTIVQVL
KGVYNVKNVI VADRIDERLE KAKESGADWA INNSQTPLSE SFAEKGIKPT LIIDAACHPS
ILKEAVTLAS PAARIVLMGF SSEPSEVIQQ GITGKELSIF SSRLNANKFP VVIDWLSKGL
IKPEKLITHT FDFQHVADAI SLFEQDQKHC CKVLLTFSE