Gene BAS5031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5031 
Symbol 
ID2852090 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4902983 
End bp4904185 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content40% 
IMG OID637508286 
Producthypothetical protein 
Protein accessionYP_031270 
Protein GI49188017 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.322167 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGGTGA ATGGGTTCGT GGAGGCATGG ATTTTTGAAA TAATGCGGGC AGTTGGACGT 
TTTTTCTTAC ACCCTGCTGT CTATGTATTT TTAATAAGTA GTATCTTCGT TGGATACTTA
CGTATGTTAC GAGAACGAAA AGATTTTTCT TTTAAAGTTT ATGATATTTG GTTTGAACTG
CGAACAGCTT TATTTGCGGG GATTGGGTAT GGATTAGTAG TATCTATTAT TACGATTGGG
CTCGGACTTG TCGTTTCTAA AGCGAGCTTA TGGGCCATTT TACTTTGGAC ATTACTATTT
GGATTAACTG CTATGTACCG ATATTTATCA GCAGCTTATA CGTTTAGTAT CGCGATTGTA
TGTGTTCTAT TATCTTCTAA GCTACCAGTT TCCTTCTTAC AGCTTGGGGA AGGTGAAGAG
AATACAATTG TGTCCCTTGC TATTTTGCTA GGCATTATGC TCGTTGTAGA GGGCTTGTTG
ATTTCTAAAA ATGCAGTAGG ATATTCGACG CCGAAGATTA GGAAGGGTAA GCGTGGACTA
AAGATTGGTT TACACGAATC AAAGCGTTTA TGGATCATTC CTATTTTTAT TCTCGTACCA
GGTGACGCAG TAACGCAGTT TATTTCATGG TGGCCTGTCG TTTCAATCGG TTCTGATACA
TATTCCCTAT TCCTCGTTCC ATTTTTAATT GGATTTATGA GAAGGATTAG AAGTTATGAG
CCGACGGAAG CTTTATTATT TACAGGAAGA CGTGTGTACG GATTAGCAGG ACTTGTACTC
GTTTTAGGAA TCGCAAGTTA TTGGTGGCAC GTGCTTGCAA TTATCGCAAT GGGTGTTGCG
ATGCTTGGAC GATTCACGAT TTCCATGCAA GAGAAAATTT CTGATGAGAC AAGACCAGCG
TATTTCGCTG CACGTAATGA TGGACTCGTT GTATTAGATA CAATCCCGAA TACAATTGGG
GCAGAGCTGA ATTTACTACC CGGAGAAATG ATTACGAAAG TAAATGGAGT CATTCCAAGA
AGCGCTGAGG AATTTTATGA TGCGCTTCAA ACGAAGACGA CAGGAGCATT TTGTAAATTA
GAAGTATTAG ATACAAATGG TGAGCTTCGC CTTGCTCAAA CGGCATTATA CGCCGGAGGA
CATCATGAAC TAGGTATTGT ATTTGTTCAG CAGGAGCATG AGTGGGATTC GGAAGCGATG
TAA
 
Protein sequence
MVVNGFVEAW IFEIMRAVGR FFLHPAVYVF LISSIFVGYL RMLRERKDFS FKVYDIWFEL 
RTALFAGIGY GLVVSIITIG LGLVVSKASL WAILLWTLLF GLTAMYRYLS AAYTFSIAIV
CVLLSSKLPV SFLQLGEGEE NTIVSLAILL GIMLVVEGLL ISKNAVGYST PKIRKGKRGL
KIGLHESKRL WIIPIFILVP GDAVTQFISW WPVVSIGSDT YSLFLVPFLI GFMRRIRSYE
PTEALLFTGR RVYGLAGLVL VLGIASYWWH VLAIIAMGVA MLGRFTISMQ EKISDETRPA
YFAARNDGLV VLDTIPNTIG AELNLLPGEM ITKVNGVIPR SAEEFYDALQ TKTTGAFCKL
EVLDTNGELR LAQTALYAGG HHELGIVFVQ QEHEWDSEAM