Gene BMASAVP1_A1197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_A1197 
SymbolthiC 
ID4679878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008785 
Strand
Start bp1182093 
End bp1184024 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content66% 
IMG OID639845470 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_992532 
Protein GI121599133 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0705849 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCA ACCCCAAGTT CCTGTCGGCC GACGCCCGCG TCGACGCCGC CGCCGTCGCC 
CCGCTGCCGA ATTCGCGCAA GGTCTACGTG ACGGGCTCGC AACCCGACAT CCGCGTGCCG
ATGCGTGAGA TCACGCAGGC CGATACGCCG ACGAGCTTCG GCGGCGAAAA GAACCCGCCG
ATCTACGTCT ACGACACGTC GGGCCCGTAC ACGGACCCGG ACGCGAAGAT CGACATTCGC
GCGGGCCTGC CCGCGCTGCG CCAGCGCTGG ATCGACGCGC GCGGCGACAC CGAGACGCTC
TCGGGCCTCA CGAGCGACTA CGGCCGCGAG CGCGCGGCCG ATCCGGCGAC GGCCGAGCTG
CGCTTTCCCG GCCTGCACCG TCATCCGCGC CGCGCGAAGG CGGGCAAGAA CGTCACGCAG
ATGCACTACG CGCGCCAGGG CATCATCACA CCGGAAATGG AATACATCGC GATCCGCGAG
AACCAGCGCC GCGCCGAGTA TCTGGAAAGC CTGAAGGCAA GCGGCCCGAA CGGCGCGAAG
CTCGCCGCGA TGATGGGCCG CCAGCACGCG GGCCAGGCAT TCGGCGCCGC CGCGTTCGGC
GCGAACGCGC CGGCCGAGAT CACGCCGGAG TTCGTGCGCG ACGAAGTCGC GCGCGGCCGC
GCGATCATCC CGGCGAACAT CAACCACCCG GAAACCGAGC CGATGATCAT CGGCCGCAAC
TTCCTCGTGA AGATCAACGC GAACATCGGC AATTCGGCCG TCACGTCGTC GATCGGCGAG
GAAGTCGACA AGATGACGTG GGCGATCCGC TGGGGCGGCG ATACGGTGAT GGATCTGTCG
ACCGGCAAGC ACATCCATGA AACGCGCGAG TGGATCATCC GCAACAGCCC GGTGCCGATC
GGCACGGTGC CGATCTACCA GGCGCTGGAG AAGGTCAACG GCAAGGCCGA GGACCTGACC
TGGGAAATCT TCCGCGACAC GCTGATCGAG CAAGCCGAGC AAGGCGTCGA CTACTTCACG
ATCCACGCGG GCGTGCGCCT GCAATACGTG CCGCTCACCG CGAACCGGAT GACGGGCATC
GTGTCGCGCG GCGGCTCGAT CATGGCGAAG TGGTGCCTCG CGCACCACAA GGAAAGCTTC
CTGTACGAAC ACTTCGAAGA GATCTGCGAG ATCATGAAGG CGTACGACGT GAGCTTCTCG
CTCGGCGACG GCCTGCGCCC CGGCTCGATC TACGACGCGA ACGACGAAGC GCAGCTCGGC
GAACTGAAGA CGCTCGGCGA GCTCACGCAG ATCGCGTGGA AGCACGACGT GCAGGTGATG
ATCGAAGGCC CCGGCCACGT GCCGATGCAG TTGATCAAGG AGAACATGGA TCTTCAGCTC
GACTGGTGCA AGGAAGCGCC GTTCTACACG CTCGGGCCGC TGACCACCGA CATCGCGCCG
GGCTACGACC ACATCACGTC GGGCATCGGC GCCGCGATGA TCGGCTGGTT CGGCACCGCG
ATGCTGTGCT ACGTGACGCC GAAGGAGCAC CTCGGGCTGC CGAACAAGGA CGACGTGAAG
GAAGGCATCA TCACGTACAA GCTCGCCGCG CACGCGGCCG ACCTGGCGAA GGGCCACCCG
GGCGCGCAGG TGCGCGACAA CGCGCTGTCG AAGGCGCGCT TCGAGTTCCG CTGGCAAGAC
CAGTTCAACC TGGGGCTCGA CCCGGACAAG GCGCGAGAAT TCCACGACGA AACGCTGCCG
AAGGATTCGG CGAAGGTCGC GCATTTCTGC TCGATGTGCG GCCCGCACTT CTGCTCGATG
AAGATCACGC AGGACGTGCG CGAGTTCGCC GCTCAGCAGG GCGTGTCAGA AAACGACGCG
CTGAAGAAGG GGATGGAAGT GAAGGCGGTC GAGTTCGTGA AGAGCGGCTC GGAGATCTAT
CACCGCCAGT GA
 
Protein sequence
MNANPKFLSA DARVDAAAVA PLPNSRKVYV TGSQPDIRVP MREITQADTP TSFGGEKNPP 
IYVYDTSGPY TDPDAKIDIR AGLPALRQRW IDARGDTETL SGLTSDYGRE RAADPATAEL
RFPGLHRHPR RAKAGKNVTQ MHYARQGIIT PEMEYIAIRE NQRRAEYLES LKASGPNGAK
LAAMMGRQHA GQAFGAAAFG ANAPAEITPE FVRDEVARGR AIIPANINHP ETEPMIIGRN
FLVKINANIG NSAVTSSIGE EVDKMTWAIR WGGDTVMDLS TGKHIHETRE WIIRNSPVPI
GTVPIYQALE KVNGKAEDLT WEIFRDTLIE QAEQGVDYFT IHAGVRLQYV PLTANRMTGI
VSRGGSIMAK WCLAHHKESF LYEHFEEICE IMKAYDVSFS LGDGLRPGSI YDANDEAQLG
ELKTLGELTQ IAWKHDVQVM IEGPGHVPMQ LIKENMDLQL DWCKEAPFYT LGPLTTDIAP
GYDHITSGIG AAMIGWFGTA MLCYVTPKEH LGLPNKDDVK EGIITYKLAA HAADLAKGHP
GAQVRDNALS KARFEFRWQD QFNLGLDPDK AREFHDETLP KDSAKVAHFC SMCGPHFCSM
KITQDVREFA AQQGVSENDA LKKGMEVKAV EFVKSGSEIY HRQ