Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0002 |
Symbol | thrB |
ID | 6143317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2800 |
End bp | 3732 |
Gene Length | 933 bp |
Protein Length | 310 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641614903 |
Product | homoserine kinase |
Protein accession | YP_001742119 |
Protein GI | 170681884 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0083] Homoserine kinase |
TIGRFAM ID | [TIGR00191] homoserine kinase |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.475933 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTAAAG TTTATGCCCC GGCTTCCAGT GCCAATATGA GCGTCGGGTT TGATGTGCTC GGGGCGGCGG TGACACCTGT TGATGGTGCA TTGCTCGGAG ATGTAGTCAC GGTTGAGGCG GCAGAGACAT TCAGTCTCAA CAACCTCGGA CGCTTTGCCG ATAAGCTGCC GTCAGAACCA CGGGAAAATA TCGTTTATCA GTGCTGGGAG CGTTTTTGCC AGGAACTGGG TAAGCAAATT CCAGTGGCGA TGACCCTGGA AAAGAATATG CCGATCGGTT CGGGCTTAGG CTCCAGTGCC TGTTCGGTGG TCGCGGCGCT GATGGCGATG AATGAACACT GCGGCAAGCC GCTTAATGAC ACTCGTTTGC TGGCTTTGAT GGGCGAGCTG GAAGGCCGTA TCTCCGGCAG CATTCATTAC GACAACGTGG CACCGTGTTT TCTTGGTGGT ATGCAGTTGA TGATCGAAGA AAACGACATC ATCAGTCAGC AAGTGCCAGG GTTTGATGAG TGGCTGTGGG TGCTGGCGTA TCCGGGGATT AAAGTCTCGA CGGCAGAAGC CCGGGCTATT TTACCGGCGC AGTATCGCCG CCAGGATTGC ATTGCGCACG GGCGACATCT GGCAGGCTTC ATTCACGCCT GCTATTCCCG TCAGCCTGAG CTTGCCGCGA AGCTGATGAA AGATGTTATC GCTGAACCCT ACCGTGAACG GTTACTGCCA GGCTTCCGGC AGGCGCGGCA GGCGGTCGCG GAAATCGGCG CGGTAGCGAG CGGTATCTCC GGCTCCGGCC CGACCTTGTT CGCTCTGTGT GACAAGCCGG AAACCGCCCA GCGCGTTGCC GACTGGTTGG GTAAGAACTA CCTGCAAAAT CAGGAAGGTT TTGTTCATAT TTGCCAGCTG GATACGGCGG GCGCACGAGT ACTGGAAAAC TAA
|
Protein sequence | MVKVYAPASS ANMSVGFDVL GAAVTPVDGA LLGDVVTVEA AETFSLNNLG RFADKLPSEP RENIVYQCWE RFCQELGKQI PVAMTLEKNM PIGSGLGSSA CSVVAALMAM NEHCGKPLND TRLLALMGEL EGRISGSIHY DNVAPCFLGG MQLMIEENDI ISQQVPGFDE WLWVLAYPGI KVSTAEARAI LPAQYRRQDC IAHGRHLAGF IHACYSRQPE LAAKLMKDVI AEPYRERLLP GFRQARQAVA EIGAVASGIS GSGPTLFALC DKPETAQRVA DWLGKNYLQN QEGFVHICQL DTAGARVLEN
|
| |