Gene BAS3050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3050 
Symbol 
ID2849560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3021944 
End bp3023059 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content33% 
IMG OID637506294 
ProductTPR domain-containing protein 
Protein accessionYP_029307 
Protein GI49186055 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.382615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGTGTTT CAGTAAAGGG AAATGAGCAA TTAACCTCTC TATTGAACGA CTGGTATCGA 
TCTATGCTAT CTCAACAAGT TGTAAAAGCT ACTAATCTAA AAAAGAAAAT TGATGAAAAA
ATTACTAAGT TAAGCATTGA GTCAAATCAA GAACGTCAGG ATCAAAATTT GTTACTATAT
TACTCACTAC TTGAATTTCG TTACACAGTC TTAACAGATA GCCTCGGTAT TCAACAAAAT
AGTTTTGATG CTATTAGTGA TTACGATATG CCTACAGACC ATTTTCTACG CTTCTATTAT
CACTTTTTTA AATCCATTCA TTCCACTTTT ATATCAAGTT TTACTGAGGC AGAGGAACAT
TATAAACTGG CAGAAAAGAT ACTAGTAAAC ATTCCAGATG AGATTGAACA TGCTGAATTC
TACTATAGAA TTGCTACTTT TTATCATCAT ACCTATAACA TGCTCGCTTC TATCGAATAC
GCAAATAAAT CGAAAGCAAT CTTTTCGAAG TATGAAGGTT ATGAAGTAAA AACAGCCTTT
TGTAATAGTT TGTTAGGGGG TTGCTGTATC TATTTAAAGC AGTACGAACA AGCAGAAGAA
TATCTACATT GTGCAATTGA ATTACTACAG AAGAATAAGG AAGAAGATTC CTTGTTATAC
GTAAAAAGTA CTATGGGGTG GCTGTATTCT GATCAAAGTA TGTCTACGTT AGCTATTCGT
CACCTTTCAG AAGTAACAGA AAAAATCCCT ACACACTTCA AAGCTATCTT CCTACAAGCT
AAGGAACATT ATAAATTAGG AGAACAATTA GCAGCTAGCA AACTCATTGA TAAGGGATTA
CAGATCTGCA GAGGAATTCA TAACGAAGAA TACACACACC ACTTCTCTAT TTTAAAAAGA
TTAAATGAAA ATATTCCACT AGAAGAATTA GAAAATATCA TTCAAGAAGG AATCTTATAC
TTTGAGCAAG AAGAATTATG GGAATATGTT GTCGAATACG CCGAATTATT TGCCACAAAA
TGTAGACAAT TTGAGAACCA CCAAAAGGTA AGTGATTATT TCCATATTTG TTATCAAGCA
AGACGAAAAT CAATCGAAAA AGGAGTGTTA AAATAA
 
Protein sequence
MSVSVKGNEQ LTSLLNDWYR SMLSQQVVKA TNLKKKIDEK ITKLSIESNQ ERQDQNLLLY 
YSLLEFRYTV LTDSLGIQQN SFDAISDYDM PTDHFLRFYY HFFKSIHSTF ISSFTEAEEH
YKLAEKILVN IPDEIEHAEF YYRIATFYHH TYNMLASIEY ANKSKAIFSK YEGYEVKTAF
CNSLLGGCCI YLKQYEQAEE YLHCAIELLQ KNKEEDSLLY VKSTMGWLYS DQSMSTLAIR
HLSEVTEKIP THFKAIFLQA KEHYKLGEQL AASKLIDKGL QICRGIHNEE YTHHFSILKR
LNENIPLEEL ENIIQEGILY FEQEELWEYV VEYAELFATK CRQFENHQKV SDYFHICYQA
RRKSIEKGVL K