Gene BBta_4643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4643 
SymbolthiL 
ID5151821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp4869595 
End bp4870587 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content67% 
IMG OID640559445 
Productthiamin-monophosphate kinase 
Protein accessionYP_001240577 
Protein GI148255992 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.567008 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.372067 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCACC AGCCCTCCGG CGAAGACTCC CTGATCGCGC GCTACTTCAA GCCGCTGGCG 
ACCGACCCTG GCGCCTTCGG CCTGGTCGAT GACGCCGCCA TCATTCCTGC CGATGGTGAC
GATCTGGTCG TCAACACCGA CGCCATCGTC GAAGGCGTCC ATTATCTGCC CGATGATCCG
CCCGACACCA TCGCGCGCAA GGCGTTGCGG GTGAACCTGT CCGATCTTGC CGCCAAAGGC
GCCGTCCCGG CCGGCTTTGT TCTAACCCTA GCACTGCGAC AGAAGGACGA AGCCTGGCTC
AGTGCCTTCG CGCGCGGGCT CGGCGAGGAC GCCGCAGCCT TCGGCTGCCC GCTTCTGGGC
GGCGACACGG TGTCGACGCC TGGTCCGGTG ATGATCTCGA TCACGGCTTG GGGCCGGGTG
CCCAAGGGGC GGATGGTGCA CCGCTTCGGC GCCCGCCCCG GCGATCGGGT CTGGGTCACG
GGAACGATCG GCGACGCGAT GCTCGGGCTT GCCGTGTCGA AGGGCGGGCC GGCGGCTGCC
GCCCTGGCCG GCGATCCCGC CGCGCGGGAT ATGCTGATCG GCCGCTATCG CGTGCCGCAG
CCGCGTCACT TGTTAGCTGT GCCGGTGAGG GAGTTTGCGA CCGCCTCGAT GGATGTCTCC
GACGGCCTCG CGGGGGATCT TTCCAAGCTC TGCGCCGCGT CGCGCGTGAG TGCCGACATC
GCTTTGTCGC AGGTGCCGAT CTCATCAGCA GCGGCAAAGC TTGTGACGGC GGGCTATCAC
CAGCTTGAAG GCCTGATCTC CGGCGGCGAC GATTATGAGA TCGTCTGCAC TGTTCCCGCA
GCGCGATGCG CCGCTTTTTG CGCTGCGGCC GGAGCGGCTG GCGTGGCTGT CACCGACATC
GGGGTCATCG TCGAAGGACC CGACGTGCCG CGCTTCCTGG ATGAGCAGGG GCGTCCGGTC
GTCTTGAAAC AGCGGTCCTA CAGCCACTTC TGA
 
Protein sequence
MTHQPSGEDS LIARYFKPLA TDPGAFGLVD DAAIIPADGD DLVVNTDAIV EGVHYLPDDP 
PDTIARKALR VNLSDLAAKG AVPAGFVLTL ALRQKDEAWL SAFARGLGED AAAFGCPLLG
GDTVSTPGPV MISITAWGRV PKGRMVHRFG ARPGDRVWVT GTIGDAMLGL AVSKGGPAAA
ALAGDPAARD MLIGRYRVPQ PRHLLAVPVR EFATASMDVS DGLAGDLSKL CAASRVSADI
ALSQVPISSA AAKLVTAGYH QLEGLISGGD DYEIVCTVPA ARCAAFCAAA GAAGVAVTDI
GVIVEGPDVP RFLDEQGRPV VLKQRSYSHF