Gene BAS5076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5076 
Symbol 
ID2850885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4952851 
End bp4954611 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content43% 
IMG OID637508331 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_031315 
Protein GI49188062 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAGT CTGTTTCAGC TGAGCAAATT GAATTGAAAT CGAGTTTACC AGGAAGTAAG 
AAAGTGTATG TGGATGGACC ACGAGAAGGT ATGAAAGTGC CGATGCGTGA GATTGAACAA
AGTGATACAA ATGGCGTTCC AAATCCGCCA ATTCGTGTGT ATGATACAAG CGGTCCTTAC
ACAGATCCTG CGTATAAAGT CGAGTTAGAG AAGGGGATTC CAACGCCGCG CCACTCTTGG
ATTCTAGAGC GCGGAGATGT AGAGGCATAC GAAGGGCGCG AAGTGAAACC AGAGGATGAC
GGTGTGAAGG TGGCTTCGAA ACATACACCT GTTTTCCCGC AAATGGATCG CAAACCGCTT
AGAGCGAAGC AAGGTGCAAA TGTTACGCAA ATGCATTATG CACGTAATGG CATCATTAAG
TCTGAGATGG AATATGTTGC GATTCGTGAA GGAGTAGACC CGGAATTTGT TCGTAAGGAA
ATCGCGGAAG GTCGCGCTAT TTTACCAGCG AATATTAACC ATCCTGAAGC AGAACCGATG
ATTATTGGGC GTAATTTCCA TGTGAAGGTT AATGCGAATA TCGGAAACTC TGCTGTATCT
TCTTCTATTG CAGAAGAAGT AGAGAAGATG ACGTGGGCAA CTCGCTGGGG TGCAGATACG
ATTATGGATT TATCTACAGG TAAAAACATT CATACGACGC GCGAGTGGAT TATTCGTAAC
GCACCTGTAC CAGTTGGAAC TGTACCAATC TATCAAGCAC TGGAAAAAGT AAACGGAATT
GCAGAAGATT TAACGTGGGA AGTGTATCGT GATACGTTAA TTGAGCAAGC GGAGCAAGGC
GTAGATTACT TTACGATTCA CGCTGGCGTA TTACTTCGTT ACATTCCAAT TACGGCGAAA
CGTACGACAG GTATCGTTTC ACGCGGTGGT TCAATTATGG CACAGTGGTG TTTATTCCAT
CATAAAGAAA ACTTCCTATA CACTCATTTT GAAGAGATTT GTGAAATTAT GAAGCAGTAC
GATGTTTCGT TCTCTCTTGG AGATGGATTA CGTCCAGGTT CGATTGCAGA TGCAAATGAC
GAAGCACAGT TTTCTGAGCT TGAAACACTT GGTGAATTAA CGAAGATTGC TTGGAAACAT
GATGTGCAAG TGATGATTGA AGGGCCTGGG CATGTACCGA TGCATTTAAT TAAAGAGAAT
ATGGAGAAAG AACTTGATAT TTGTCAGGGC GCGCCGTTCT ATACACTTGG GCCGTTAACG
ACAGATATTG CACCAGGTTA TGACCATATT ACATCTGCGA TTGGAGCTGC GATGATTGGT
TGGTTTGGAA CGGCGATGCT TTGTTATGTA ACGCCGAAAG AACATTTAGG TTTACCAAAT
AAAGATGATG TTCGAGAAGG TGTTATTACG TACAAAATCG CTGCACATGC GGCTGATCTA
GCGAAAGGTC ACAAAACGGC TCATCAGCGT GATGATGCCC TTTCAAAAGC ACGCTTTGAA
TTCCGTTGGC GCGATCAATT TAATTTATCT TTAGATCCTG AACGCGCGAT GGAGTATCAC
GATGAAACAT TGCCAGCAGA AGGAGCGAAA ACGGCTCATT TCTGTTCCAT GTGTGGACCG
AAGTTTTGTA GTATGAGAAT TTCACATGAT ATTCGTGAAT ACGCAAAAGA AAATGATTTA
GAAACGACAG AAGCAATTGA AAAAGGAATG AAAGAGAAAG CGAAAGAATT TAAAGAAACT
GGTAGTCATT TATACCAATA A
 
Protein sequence
MKQSVSAEQI ELKSSLPGSK KVYVDGPREG MKVPMREIEQ SDTNGVPNPP IRVYDTSGPY 
TDPAYKVELE KGIPTPRHSW ILERGDVEAY EGREVKPEDD GVKVASKHTP VFPQMDRKPL
RAKQGANVTQ MHYARNGIIK SEMEYVAIRE GVDPEFVRKE IAEGRAILPA NINHPEAEPM
IIGRNFHVKV NANIGNSAVS SSIAEEVEKM TWATRWGADT IMDLSTGKNI HTTREWIIRN
APVPVGTVPI YQALEKVNGI AEDLTWEVYR DTLIEQAEQG VDYFTIHAGV LLRYIPITAK
RTTGIVSRGG SIMAQWCLFH HKENFLYTHF EEICEIMKQY DVSFSLGDGL RPGSIADAND
EAQFSELETL GELTKIAWKH DVQVMIEGPG HVPMHLIKEN MEKELDICQG APFYTLGPLT
TDIAPGYDHI TSAIGAAMIG WFGTAMLCYV TPKEHLGLPN KDDVREGVIT YKIAAHAADL
AKGHKTAHQR DDALSKARFE FRWRDQFNLS LDPERAMEYH DETLPAEGAK TAHFCSMCGP
KFCSMRISHD IREYAKENDL ETTEAIEKGM KEKAKEFKET GSHLYQ