Gene TBFG_10428 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTBFG_10428 
Symbol 
ID5221092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium tuberculosis F11 
KingdomBacteria 
Replicon accessionNC_009565 
Strand
Start bp511752 
End bp513395 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content65% 
IMG OID640605169 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001286373 
Protein GI148821619 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones237 
Plasmid unclonability p-value0.00121704 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones214 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCA CCGTTGAACC GTCGGTGACC ACGGGTCCCA TCGCGGGCAG CGCCAAGGCC 
TACCGTGAAA TCGAGGCTCC CGGCAGCGGA GCTACTCTCC AAGTCCCGTT TCGACGGGTG
CACTTGTCCA CCGGAGACCA CTTCGACCTC TACGACACCT CCGGGCCCTA CACCGACACG
GACACGGTGA TCGACCTGAC CGCGGGGCTG CCGCATAGGC CCGGAGTGGT TCGCGATCGG
GGCACCCAGC TGCAGCGGGC CCGCGCCGGG GAGATCACCG CCGAGATGGC GTTCATCGCC
GCCCGCGAAG ACATGTCCGC CGAGCTAGTG CGCGACGAGG TCGCCCGCGG CCGCGCGGTG
ATCCCGGCCA ACCACCACCA CCCCGAGAGC GAGCCGATGA TCATCGGCAA GGCGTTCGCG
GTGAAAGTCA ACGCCAACAT CGGCAACTCG GCGGTGACGA GCTCGATCGC CGAGGAGGTC
GACAAGATGG TGTGGGCCAC CCGCTGGGGG GCCGACACCA TCATGGACCT GTCCACCGGC
AAGAACATCC ACGAAACCCG CGAGTGGATC CTGCGCAATT CTCCCGTGCC GGTCGGCACC
GTGCCGATCT ATCAGGCGCT GGAGAAAGTC AAGGGCGATC CGACCGAGCT GACCTGGGAG
ATCTACCGCG ACACCGTGAT CGAGCAGTGT GAGCAAGGCG TGGACTACAT GACGGTGCAC
GCCGGGGTGC TGCTGCGGTA TGTGCCGCTG ACCGCCAAGC GGGTCACCGG CATCGTGTCC
CGCGGGGGTT CGATCATGGC CGCGTGGTGT TTGGCACATC ATCGGGAGTC GTTCTTGTAC
ACCAACTTTG AGGAGCTCTG CGATATTTTC GCCCGCTACG ACGTCACCTT CTCACTCGGT
GACGGGCTGC GACCAGGGTC GATCGCTGAT GCCAACGACG CCGCGCAGTT CGCCGAGCTG
CGCACCCTGG GCGAGCTCAC CAAGATCGCC AAAGCCCATG GCGCACAGGT GATGATCGAG
GGGCCGGGCC ATATCCCAAT GCACAAGATC GTCGAGAATG TGCGGCTGGA AGAGGAACTG
TGTGAGGAGG CCCCGTTCTA CACGCTGGGT CCGCTGGCCA CCGACATCGC GCCGGCCTAC
GACCACATCA CCTCGGCGAT CGGTGCGGCC ATCATCGCCC AAGCCGGTAC CGCGATGCTG
TGCTACGTCA CCCCCAAGGA GCACCTCGGG TTGCCGGACC GCAAGGACGT CAAGGACGGG
GTGATCGCCT ACAAGATCGC CGCGCATGCG GCCGATTTGG CCAAGGGCCA TCCGCGCGCC
CAGGAGCGCG ACGACGCTTT GAGCACGGCG CGTTTCGAGT TCCGCTGGAA CGACCAGTTC
GCACTGTCGC TGGATCCCGA CACCGCACGG GAATTCCACG ACGAAACCCT GCCGGCGGAG
CCGGCCAAGA CCGCGCACTT CTGCTCGATG TGCGGACCGA AGTTCTGCTC CATGCGCATC
ACCCAGGACG TCCGTGAGTA CGCCGCCGAA CACGGGCTTG AGACCGAAGC GGACATCGAA
GCCGTGCTCG CCGCCGGAAT GGCCGAAAAG TCACGTGAAT TCGCAGAGCA CGGCAATCGG
GTGTATCTCC CGATAACCCA GTGA
 
Protein sequence
MTITVEPSVT TGPIAGSAKA YREIEAPGSG ATLQVPFRRV HLSTGDHFDL YDTSGPYTDT 
DTVIDLTAGL PHRPGVVRDR GTQLQRARAG EITAEMAFIA AREDMSAELV RDEVARGRAV
IPANHHHPES EPMIIGKAFA VKVNANIGNS AVTSSIAEEV DKMVWATRWG ADTIMDLSTG
KNIHETREWI LRNSPVPVGT VPIYQALEKV KGDPTELTWE IYRDTVIEQC EQGVDYMTVH
AGVLLRYVPL TAKRVTGIVS RGGSIMAAWC LAHHRESFLY TNFEELCDIF ARYDVTFSLG
DGLRPGSIAD ANDAAQFAEL RTLGELTKIA KAHGAQVMIE GPGHIPMHKI VENVRLEEEL
CEEAPFYTLG PLATDIAPAY DHITSAIGAA IIAQAGTAML CYVTPKEHLG LPDRKDVKDG
VIAYKIAAHA ADLAKGHPRA QERDDALSTA RFEFRWNDQF ALSLDPDTAR EFHDETLPAE
PAKTAHFCSM CGPKFCSMRI TQDVREYAAE HGLETEADIE AVLAAGMAEK SREFAEHGNR
VYLPITQ