Gene BURPS668_1395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1395 
SymbolthiC 
ID4883642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1366016 
End bp1367947 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content66% 
IMG OID640127323 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001058438 
Protein GI126439171 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCA ACCCCAAGTT CCTGTCGGCC GACGCCCGCG TCGACGCCGC CGCCGTCGCC 
CCGCTGCCGA ATTCGCGCAA GGTCTACGTG ACGGGCTCGC AACCCGACAT CCGCGTGCCG
ATGCGTGAGA TCACGCAGGC CGATACGCCG ACGAGCTTCG GCGGCGAAAA GAACCCGCCG
ATCTACGTCT ACGACACGTC GGGCCCGTAC ACGGACCCGG ACGCGAAGAT CGACATTCGC
GCGGGCCTGC CCGCGCTGCG CCAGCGCTGG ATCGACGCGC GCGGCGACAC CGAGACGCTC
GCGGGCCTCA CGAGCGACTA CGGCCGCGAG CGCGCGGCCG ATCCGGCGAC GGCCGAGCTG
CGCTTTCCCG GCCTGCACCG CCATCCGCGC CGCGCGAAGG CGGGCAAGAA CGTCACGCAG
ATGCACTACG CGCGCCAGGG CATCATCACA CCGGAAATGG AATACATCGC GATCCGCGAG
AACCAGCGAC GCGCCGAGTA TCTGGAAAGC CTGAAGGCAA GCGGCCCGAA CGGCGCGAAG
CTCGCCGCGA TGATGGGCCG CCAGCACGCG GGCCAGGCAT TCGGCGCCGC CGCGTTCGGC
GCGAACGCGC CGGCCGAGAT CACGCCGGAG TTCGTGCGCG ACGAAGTCGC GCGCGGCCGC
GCGATCATCC CGGCGAACAT CAACCACCCG GAAACCGAGC CGATGATCAT CGGCCGCAAC
TTCCTCGTGA AGATCAACGC GAACATCGGC AATTCGGCCG TCACGTCGTC GATCGGCGAG
GAAGTCGACA AGATGACGTG GGCGATCCGC TGGGGCGGCG ATACGGTGAT GGATCTGTCG
ACCGGCAAGC ACATCCATGA AACGCGCGAG TGGATCATCC GCAACAGCCC GGTGCCGATC
GGCACGGTGC CGATCTACCA GGCGCTGGAG AAGGTCAACG GCAAGGCCGA GGACCTGACC
TGGGAAATCT TCCGCGACAC GCTGATCGAG CAGGCCGAGC AAGGCGTCGA CTACTTCACG
ATCCACGCGG GCGTGCGCCT GCAATACGTG CCGCTCACCG CGAACCGGAT GACGGGCATC
GTGTCGCGCG GCGGCTCGAT TATGGCGAAG TGGTGCCTCG CGCACCACAA GGAAAGCTTC
CTGTACGAAC ACTTCGAAGA GATCTGCGAG ATCATGAAGG CGTACGACGT GAGCTTCTCG
CTCGGCGACG GCCTGCGCCC CGGCTCGATC TACGACGCGA ACGACGAAGC GCAGCTCGGC
GAACTGAAGA CGCTCGGTGA GCTCACGCAG ATCGCATGGA AGCACGACGT GCAGGTGATG
ATCGAAGGCC CCGGCCACGT GCCGATGCAG TTGATCAAGG AGAACATGGA TCTTCAGCTC
GACTGGTGCA AGGAAGCGCC GTTCTACACA CTCGGGCCGC TGACCACCGA CATCGCGCCG
GGCTACGACC ACATCACGTC GGGCATCGGC GCCGCGATGA TCGGCTGGTT CGGCACCGCG
ATGCTGTGCT ACGTGACGCC GAAGGAGCAC CTCGGGCTGC CGAACAAGGA CGACGTGAAG
GAAGGCATCA TCACGTACAA GCTCGCCGCG CACGCGGCCG ACCTGGCGAA GGGCCACCCG
GGCGCGCAGG TACGCGACAA CGCGCTGTCG AAGGCGCGCT TCGAGTTCCG CTGGCAAGAC
CAGTTCAACC TGGGGCTCGA CCCGGACAAG GCGCGAGAAT TCCACGACGA AACGCTGCCG
AAGGATTCGG CGAAGGTCGC GCATTTCTGC TCGATGTGCG GCCCGCACTT CTGCTCGATG
AAGATCACGC AGGACGTGCG CGAGTTCGCC GCTCAGCAGG GCGTGTCGGA AAACGACGCG
CTGAAGAAGG GGATGGAAGT GAAGGCGGTC GAGTTCGTGA AGAGCGGCTC GGAGATCTAT
CACCGCCAGT GA
 
Protein sequence
MNANPKFLSA DARVDAAAVA PLPNSRKVYV TGSQPDIRVP MREITQADTP TSFGGEKNPP 
IYVYDTSGPY TDPDAKIDIR AGLPALRQRW IDARGDTETL AGLTSDYGRE RAADPATAEL
RFPGLHRHPR RAKAGKNVTQ MHYARQGIIT PEMEYIAIRE NQRRAEYLES LKASGPNGAK
LAAMMGRQHA GQAFGAAAFG ANAPAEITPE FVRDEVARGR AIIPANINHP ETEPMIIGRN
FLVKINANIG NSAVTSSIGE EVDKMTWAIR WGGDTVMDLS TGKHIHETRE WIIRNSPVPI
GTVPIYQALE KVNGKAEDLT WEIFRDTLIE QAEQGVDYFT IHAGVRLQYV PLTANRMTGI
VSRGGSIMAK WCLAHHKESF LYEHFEEICE IMKAYDVSFS LGDGLRPGSI YDANDEAQLG
ELKTLGELTQ IAWKHDVQVM IEGPGHVPMQ LIKENMDLQL DWCKEAPFYT LGPLTTDIAP
GYDHITSGIG AAMIGWFGTA MLCYVTPKEH LGLPNKDDVK EGIITYKLAA HAADLAKGHP
GAQVRDNALS KARFEFRWQD QFNLGLDPDK AREFHDETLP KDSAKVAHFC SMCGPHFCSM
KITQDVREFA AQQGVSENDA LKKGMEVKAV EFVKSGSEIY HRQ