Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1620 |
Symbol | rspB |
ID | 6143563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1609913 |
End bp | 1610932 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641616496 |
Product | putative dehydrogenase |
Protein accession | YP_001743674 |
Protein GI | 170681959 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00863317 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 76 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGCA TTTTAATTGA AAAACCGAAT CAACTGGCAA TTATCGAACG CGAAATACCC ACCCCGTCAG CGGGTGAAGT ACGAGTAAAA GTGAAACTTG CCGGAATTTG TGGTTCAGAT AGCCATATTT ATCGTGGGCA TAATCCTTTT GCGAAATATC CGCGCGTCAT TGGACATGAA TTCTTTGGCG TCATTGATGC AGTGGGTGAA GGCGTGGAAA GCGACAGAGT CGGTGAACGC GTTGCTGTCG ATCCGGTGGT CAGCTGTGGG CATTGCTATC CGTGCTCTAT AGGTAAGCCG AACGTTTGTA CGACACTGGC TGTATTAGGT GTGCACGCTG ACGGTGGTTT CAGTGAATAT GCCGTGGTGC CGGCAAAAAA TGCGTGGAAA ATTCCTGAAG CAGTGGCCGA TCAATATGCG GTGATGATTG AACCTTTTAC CATTGCGGCT AACGTTACCG GTCATGGTCA ACCGACTGAA AATGATACCG TTCTGGTTTA CGGTGCAGGT CCAATCGGCC TGACGATCGT TCAGGTATTA AAAGGCGTCT ATAACGTTAA GAATGTGATT GTTGCCGATC GCATTGATGA ACGACTGGAA AAAGCGAAAG AGAGCGGGGC AGACTGGGCG ATTAATAACA GCCAGACACC GCTTAGCGAG AGTTTCGCTG AAAAAGGCAT CAAGCCGACA TTAATTATCG ATGCGGCTTG TCATCCTTCA ATCCTGAAAG AAGCCGTAAC GCTGGCTTCT CCAGCGGCAC GTATTGTATT GATGGGCTTC TCCAGTGAAC CGTCTGAAGT GATTCAGCAA GGAATTACCG GAAAAGAACT CTCTATTTTC TCTTCACGCT TAAATGCAAA TAAATTCCCG GTCGTTATCG ACTGGTTAAG TAAAGGGTTA ATTAAACCAG AAAAACTAAT TACCCATACG TTTGATTTCC AGCATGTTGC TGATGCCATT AGTTTATTTG AACAGGATCA AAAGCATTGC TGCAAAGTCT TACTCACTTT TTCTGAATAA
|
Protein sequence | MKSILIEKPN QLAIIEREIP TPSAGEVRVK VKLAGICGSD SHIYRGHNPF AKYPRVIGHE FFGVIDAVGE GVESDRVGER VAVDPVVSCG HCYPCSIGKP NVCTTLAVLG VHADGGFSEY AVVPAKNAWK IPEAVADQYA VMIEPFTIAA NVTGHGQPTE NDTVLVYGAG PIGLTIVQVL KGVYNVKNVI VADRIDERLE KAKESGADWA INNSQTPLSE SFAEKGIKPT LIIDAACHPS ILKEAVTLAS PAARIVLMGF SSEPSEVIQQ GITGKELSIF SSRLNANKFP VVIDWLSKGL IKPEKLITHT FDFQHVADAI SLFEQDQKHC CKVLLTFSE
|
| |