Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMASAVP1_A1197 |
Symbol | thiC |
ID | 4679878 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei SAVP1 |
Kingdom | Bacteria |
Replicon accession | NC_008785 |
Strand | - |
Start bp | 1182093 |
End bp | 1184024 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639845470 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_992532 |
Protein GI | 121599133 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0705849 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGCCA ACCCCAAGTT CCTGTCGGCC GACGCCCGCG TCGACGCCGC CGCCGTCGCC CCGCTGCCGA ATTCGCGCAA GGTCTACGTG ACGGGCTCGC AACCCGACAT CCGCGTGCCG ATGCGTGAGA TCACGCAGGC CGATACGCCG ACGAGCTTCG GCGGCGAAAA GAACCCGCCG ATCTACGTCT ACGACACGTC GGGCCCGTAC ACGGACCCGG ACGCGAAGAT CGACATTCGC GCGGGCCTGC CCGCGCTGCG CCAGCGCTGG ATCGACGCGC GCGGCGACAC CGAGACGCTC TCGGGCCTCA CGAGCGACTA CGGCCGCGAG CGCGCGGCCG ATCCGGCGAC GGCCGAGCTG CGCTTTCCCG GCCTGCACCG TCATCCGCGC CGCGCGAAGG CGGGCAAGAA CGTCACGCAG ATGCACTACG CGCGCCAGGG CATCATCACA CCGGAAATGG AATACATCGC GATCCGCGAG AACCAGCGCC GCGCCGAGTA TCTGGAAAGC CTGAAGGCAA GCGGCCCGAA CGGCGCGAAG CTCGCCGCGA TGATGGGCCG CCAGCACGCG GGCCAGGCAT TCGGCGCCGC CGCGTTCGGC GCGAACGCGC CGGCCGAGAT CACGCCGGAG TTCGTGCGCG ACGAAGTCGC GCGCGGCCGC GCGATCATCC CGGCGAACAT CAACCACCCG GAAACCGAGC CGATGATCAT CGGCCGCAAC TTCCTCGTGA AGATCAACGC GAACATCGGC AATTCGGCCG TCACGTCGTC GATCGGCGAG GAAGTCGACA AGATGACGTG GGCGATCCGC TGGGGCGGCG ATACGGTGAT GGATCTGTCG ACCGGCAAGC ACATCCATGA AACGCGCGAG TGGATCATCC GCAACAGCCC GGTGCCGATC GGCACGGTGC CGATCTACCA GGCGCTGGAG AAGGTCAACG GCAAGGCCGA GGACCTGACC TGGGAAATCT TCCGCGACAC GCTGATCGAG CAAGCCGAGC AAGGCGTCGA CTACTTCACG ATCCACGCGG GCGTGCGCCT GCAATACGTG CCGCTCACCG CGAACCGGAT GACGGGCATC GTGTCGCGCG GCGGCTCGAT CATGGCGAAG TGGTGCCTCG CGCACCACAA GGAAAGCTTC CTGTACGAAC ACTTCGAAGA GATCTGCGAG ATCATGAAGG CGTACGACGT GAGCTTCTCG CTCGGCGACG GCCTGCGCCC CGGCTCGATC TACGACGCGA ACGACGAAGC GCAGCTCGGC GAACTGAAGA CGCTCGGCGA GCTCACGCAG ATCGCGTGGA AGCACGACGT GCAGGTGATG ATCGAAGGCC CCGGCCACGT GCCGATGCAG TTGATCAAGG AGAACATGGA TCTTCAGCTC GACTGGTGCA AGGAAGCGCC GTTCTACACG CTCGGGCCGC TGACCACCGA CATCGCGCCG GGCTACGACC ACATCACGTC GGGCATCGGC GCCGCGATGA TCGGCTGGTT CGGCACCGCG ATGCTGTGCT ACGTGACGCC GAAGGAGCAC CTCGGGCTGC CGAACAAGGA CGACGTGAAG GAAGGCATCA TCACGTACAA GCTCGCCGCG CACGCGGCCG ACCTGGCGAA GGGCCACCCG GGCGCGCAGG TGCGCGACAA CGCGCTGTCG AAGGCGCGCT TCGAGTTCCG CTGGCAAGAC CAGTTCAACC TGGGGCTCGA CCCGGACAAG GCGCGAGAAT TCCACGACGA AACGCTGCCG AAGGATTCGG CGAAGGTCGC GCATTTCTGC TCGATGTGCG GCCCGCACTT CTGCTCGATG AAGATCACGC AGGACGTGCG CGAGTTCGCC GCTCAGCAGG GCGTGTCAGA AAACGACGCG CTGAAGAAGG GGATGGAAGT GAAGGCGGTC GAGTTCGTGA AGAGCGGCTC GGAGATCTAT CACCGCCAGT GA
|
Protein sequence | MNANPKFLSA DARVDAAAVA PLPNSRKVYV TGSQPDIRVP MREITQADTP TSFGGEKNPP IYVYDTSGPY TDPDAKIDIR AGLPALRQRW IDARGDTETL SGLTSDYGRE RAADPATAEL RFPGLHRHPR RAKAGKNVTQ MHYARQGIIT PEMEYIAIRE NQRRAEYLES LKASGPNGAK LAAMMGRQHA GQAFGAAAFG ANAPAEITPE FVRDEVARGR AIIPANINHP ETEPMIIGRN FLVKINANIG NSAVTSSIGE EVDKMTWAIR WGGDTVMDLS TGKHIHETRE WIIRNSPVPI GTVPIYQALE KVNGKAEDLT WEIFRDTLIE QAEQGVDYFT IHAGVRLQYV PLTANRMTGI VSRGGSIMAK WCLAHHKESF LYEHFEEICE IMKAYDVSFS LGDGLRPGSI YDANDEAQLG ELKTLGELTQ IAWKHDVQVM IEGPGHVPMQ LIKENMDLQL DWCKEAPFYT LGPLTTDIAP GYDHITSGIG AAMIGWFGTA MLCYVTPKEH LGLPNKDDVK EGIITYKLAA HAADLAKGHP GAQVRDNALS KARFEFRWQD QFNLGLDPDK AREFHDETLP KDSAKVAHFC SMCGPHFCSM KITQDVREFA AQQGVSENDA LKKGMEVKAV EFVKSGSEIY HRQ
|
| |