Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_4643 |
Symbol | thiL |
ID | 5151821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 4869595 |
End bp | 4870587 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640559445 |
Product | thiamin-monophosphate kinase |
Protein accession | YP_001240577 |
Protein GI | 148255992 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0611] Thiamine monophosphate kinase |
TIGRFAM ID | [TIGR01379] thiamine-monophosphate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.567008 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.372067 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCACC AGCCCTCCGG CGAAGACTCC CTGATCGCGC GCTACTTCAA GCCGCTGGCG ACCGACCCTG GCGCCTTCGG CCTGGTCGAT GACGCCGCCA TCATTCCTGC CGATGGTGAC GATCTGGTCG TCAACACCGA CGCCATCGTC GAAGGCGTCC ATTATCTGCC CGATGATCCG CCCGACACCA TCGCGCGCAA GGCGTTGCGG GTGAACCTGT CCGATCTTGC CGCCAAAGGC GCCGTCCCGG CCGGCTTTGT TCTAACCCTA GCACTGCGAC AGAAGGACGA AGCCTGGCTC AGTGCCTTCG CGCGCGGGCT CGGCGAGGAC GCCGCAGCCT TCGGCTGCCC GCTTCTGGGC GGCGACACGG TGTCGACGCC TGGTCCGGTG ATGATCTCGA TCACGGCTTG GGGCCGGGTG CCCAAGGGGC GGATGGTGCA CCGCTTCGGC GCCCGCCCCG GCGATCGGGT CTGGGTCACG GGAACGATCG GCGACGCGAT GCTCGGGCTT GCCGTGTCGA AGGGCGGGCC GGCGGCTGCC GCCCTGGCCG GCGATCCCGC CGCGCGGGAT ATGCTGATCG GCCGCTATCG CGTGCCGCAG CCGCGTCACT TGTTAGCTGT GCCGGTGAGG GAGTTTGCGA CCGCCTCGAT GGATGTCTCC GACGGCCTCG CGGGGGATCT TTCCAAGCTC TGCGCCGCGT CGCGCGTGAG TGCCGACATC GCTTTGTCGC AGGTGCCGAT CTCATCAGCA GCGGCAAAGC TTGTGACGGC GGGCTATCAC CAGCTTGAAG GCCTGATCTC CGGCGGCGAC GATTATGAGA TCGTCTGCAC TGTTCCCGCA GCGCGATGCG CCGCTTTTTG CGCTGCGGCC GGAGCGGCTG GCGTGGCTGT CACCGACATC GGGGTCATCG TCGAAGGACC CGACGTGCCG CGCTTCCTGG ATGAGCAGGG GCGTCCGGTC GTCTTGAAAC AGCGGTCCTA CAGCCACTTC TGA
|
Protein sequence | MTHQPSGEDS LIARYFKPLA TDPGAFGLVD DAAIIPADGD DLVVNTDAIV EGVHYLPDDP PDTIARKALR VNLSDLAAKG AVPAGFVLTL ALRQKDEAWL SAFARGLGED AAAFGCPLLG GDTVSTPGPV MISITAWGRV PKGRMVHRFG ARPGDRVWVT GTIGDAMLGL AVSKGGPAAA ALAGDPAARD MLIGRYRVPQ PRHLLAVPVR EFATASMDVS DGLAGDLSKL CAASRVSADI ALSQVPISSA AAKLVTAGYH QLEGLISGGD DYEIVCTVPA ARCAAFCAAA GAAGVAVTDI GVIVEGPDVP RFLDEQGRPV VLKQRSYSHF
|
| |