Gene BURPS1106A_1401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1401 
SymbolthiC 
ID4900194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1376213 
End bp1378144 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content66% 
IMG OID640134631 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001065674 
Protein GI126452384 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCA ACCCCAAGTT CCTGTCGGCC GACGCCCGCG TCGACGCCGC CGCCGTCGCC 
CCGCTGCCGA ATTCGCGCAA GGTCTACGTG ACGGGCTCGC AACCCGACAT CCGCGTGCCG
ATGCGTGAGA TCACGCAGGC CGATACGCCG ACGAGCTTCG GCGGCGAAAA GAACCCGCCG
ATCTACGTCT ACGACACGTC GGGCCCGTAC ACGGACCCGG ACGCGAAGAT CGACATTCGC
GCGGGCCTGC CCGCGCTGCG CCAGCGCTGG ATCGACGCGC GCGGCGACAC CGAGACGCTC
GCGGGCCTCA CGAGCGACTA CGGCCGCGAG CGCGCGGCCG ATCCGGCGAC GGCCGAGCTG
CGCTTTCCCG GCCTGCACCG TCATCCGCGC CGCGCGAAGG CGGGCAAGAA CGTCACGCAG
ATGCACTACG CGCGCCAGGG CATCATCACG CCGGAAATGG AATACATCGC GATCCGCGAG
AACCAGCGCC GCGCCGAGTA TCTGGAAAGC CTGAAGGCAA GCGGCCCGAA CGGCGCGAAG
CTCGCCGCGA TGATGGGCCG CCAGCACGCG GGCCAGGCAT TCGGCGCCGC CGCGTTCGGC
GCGAACGCGC CGGCCGAGAT CACGCCGGAG TTCGTGCGCG ACGAAGTCGC GCGCGGCCGC
GCGATCATCC CGGCGAACAT CAACCACCCG GAAACCGAGC CGATGATCAT CGGCCGCAAC
TTCCTCGTGA AGATCAACGC GAACATCGGC AATTCGGCCG TCACGTCGTC GATCGGCGAG
GAAGTCGACA AGATGACGTG GGCGATCCGC TGGGGCGGCG ATACGGTGAT GGATCTGTCG
ACCGGCAAGC ACATCCATGA AACGCGCGAG TGGATCATCC GCAACAGCCC GGTGCCGATC
GGCACGGTGC CGATCTACCA GGCGCTGGAG AAGGTCAACG GCAAGGCCGA GGACCTGACC
TGGGAAATCT TCCGCGACAC GCTGATCGAG CAGGCCGAGC AAGGCGTCGA CTACTTCACG
ATCCACGCGG GCGTGCGCCT GCAATACGTG CCGCTCACCG CGAACCGGAT GACGGGCATC
GTGTCGCGCG GCGGCTCGAT CATGGCGAAG TGGTGCCTCG CGCACCACAA GGAAAGCTTC
CTGTACGAAC ACTTCGAAGA GATCTGCGAG ATCATGAAGG CGTACGACGT GAGCTTCTCG
CTCGGCGACG GCCTGCGCCC CGGCTCGATC TACGACGCGA ACGACGAAGC GCAGCTCGGC
GAACTGAAGA CGCTCGGCGA GCTCACGCAG ATCGCGTGGA AGCACGACGT GCAGGTGATG
ATCGAAGGCC CCGGCCACGT GCCGATGCAG TTGATCAAGG AGAACATGGA TCTTCAGCTC
GACTGGTGCA AGGAAGCGCC GTTCTACACG CTCGGGCCGC TGACCACCGA CATCGCGCCG
GGCTACGACC ACATCACGTC GGGCATCGGC GCCGCGATGA TCGGCTGGTT CGGCACCGCG
ATGCTGTGCT ACGTGACGCC GAAGGAGCAC CTCGGGCTGC CGAACAAGGA CGACGTGAAG
GAAGGCATCA TCACGTACAA GCTCGCCGCG CACGCGGCCG ACCTGGCGAA GGGCCACCCG
GGCGCGCAGG TGCGCGACAA CGCGCTGTCG AAGGCGCGCT TCGAGTTCCG CTGGCAAGAC
CAGTTCAACC TGGGGCTCGA CCCGGACAAG GCGCGAGAAT TCCACGACGA AACGCTGCCG
AAGGATTCGG CGAAGGTCGC GCATTTCTGC TCGATGTGCG GCCCGCACTT CTGCTCGATG
AAGATCACGC AGGACGTGCG CGAGTTCGCC GCTCAGCAGG GCGTGTCGGA AAACGACGCG
CTGAAGAAGG GGATGGAAGT GAAGGCGGTC GAGTTCGTGA AGAGCGGCTC GGAGATCTAT
CACCGCCAGT GA
 
Protein sequence
MNANPKFLSA DARVDAAAVA PLPNSRKVYV TGSQPDIRVP MREITQADTP TSFGGEKNPP 
IYVYDTSGPY TDPDAKIDIR AGLPALRQRW IDARGDTETL AGLTSDYGRE RAADPATAEL
RFPGLHRHPR RAKAGKNVTQ MHYARQGIIT PEMEYIAIRE NQRRAEYLES LKASGPNGAK
LAAMMGRQHA GQAFGAAAFG ANAPAEITPE FVRDEVARGR AIIPANINHP ETEPMIIGRN
FLVKINANIG NSAVTSSIGE EVDKMTWAIR WGGDTVMDLS TGKHIHETRE WIIRNSPVPI
GTVPIYQALE KVNGKAEDLT WEIFRDTLIE QAEQGVDYFT IHAGVRLQYV PLTANRMTGI
VSRGGSIMAK WCLAHHKESF LYEHFEEICE IMKAYDVSFS LGDGLRPGSI YDANDEAQLG
ELKTLGELTQ IAWKHDVQVM IEGPGHVPMQ LIKENMDLQL DWCKEAPFYT LGPLTTDIAP
GYDHITSGIG AAMIGWFGTA MLCYVTPKEH LGLPNKDDVK EGIITYKLAA HAADLAKGHP
GAQVRDNALS KARFEFRWQD QFNLGLDPDK AREFHDETLP KDSAKVAHFC SMCGPHFCSM
KITQDVREFA AQQGVSENDA LKKGMEVKAV EFVKSGSEIY HRQ