Gene BAS1921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS1921 
Symbol 
ID2852339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp1937019 
End bp1938173 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content36% 
IMG OID637505171 
Productstage II sporulation protein P 
Protein accessionYP_028184 
Protein GI49184932 
COG category 
COG ID 
TIGRFAM ID[TIGR02867] stage II sporulation protein P 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0118573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCGAG GCTTTTTTTA TGTGAAGTTC ACGAGTGTTC GTAAGTTAGT ATTATTTATT 
ATTGCTACAG TACTAGCGAC TTTTTTTCTT ATTAGTATGA TGGTAACCTC TATGAAAGAG
ACAAAGTCAA CGTATTTATA TAATTGGTTA AATGAGTTAT CAATGAATGG TTACATGTAC
GTTCTTGGAA AAGAGAATCA TTATTTTACA CAGGAATATC GAAATTTAAA TCAAGATTTT
TCAATATCTT CGTTTCTCTT TTCTATGGCT ACGAATATTC GTTTTAACGA TGTACGCAGT
TTTGTCGGCA AAGAGCTACC GGGTTTCGGT AAGTACGATA CAGAAATTGT TATTGCGGGT
GAAGGAACAA ATTATTCTAA CTTGCCTATA GAGTCGAGCG TTCCACTTGA AGAAGTAGTA
AAAGAACGGA CTGGAGAAGG TGGACAGGCT CCAAAGCCGG ATACGAATAA AGAGAAAAAG
CAACCAGCTC AAACGACAGG AAAACGACAA GTTGCATTTA TTTATCATTC GCATAGTTGG
GAATCTTATT TGCCGTTATT GAATTTAACA AATGATCCAA ATCCGAATAA AGCAACAAGT
TCCGTCACGA ATATTTCAAT AGTCGGTGAC CGATTTCGTG AACAATTAGC AAATGAAGGG
ATCGGAGCAA CTAACGACAA GACTGATGTT GGGCAAAAGT TGATTAGTAA AGGATTAAAT
AGCAATAGTT CTTATAAAAT GTCACGAGAA ATTGTACAAG AAGCAATGAC TAGCAATAAG
GAATTGCAGT ATTTTTTTGA TTTACATCGT GATAGTGCTC GGAAAAATGT AACGACAAAA
GCAATTGGAG ATAAATCATA TGCAAAGCTT GCTTTCGTAA TAGGGAAAGG TAATAAAAAT
TATGAAAAGA ATTTACAATT AGCAACGGCT TTACATGAGA CAATTAATAA GAAGTATCCA
GGAGTTAGCC GCGGTGTCAT TCAAAAAGGG TTCCAAACAG GCAATGGAGT CTATAATCAA
GATCTGTCAG GGCAAGCAAT ATTAATTGAA GTTGGTGGCG TAGATAATAC AGAGGAAGAA
CTAAATCGAT CGATTGATGT ACTTGCTAAA GCGTTTGGGG AATATTTCTG GCAGGCAGAA
AAGGTGAATG GATAA
 
Protein sequence
MNRGFFYVKF TSVRKLVLFI IATVLATFFL ISMMVTSMKE TKSTYLYNWL NELSMNGYMY 
VLGKENHYFT QEYRNLNQDF SISSFLFSMA TNIRFNDVRS FVGKELPGFG KYDTEIVIAG
EGTNYSNLPI ESSVPLEEVV KERTGEGGQA PKPDTNKEKK QPAQTTGKRQ VAFIYHSHSW
ESYLPLLNLT NDPNPNKATS SVTNISIVGD RFREQLANEG IGATNDKTDV GQKLISKGLN
SNSSYKMSRE IVQEAMTSNK ELQYFFDLHR DSARKNVTTK AIGDKSYAKL AFVIGKGNKN
YEKNLQLATA LHETINKKYP GVSRGVIQKG FQTGNGVYNQ DLSGQAILIE VGGVDNTEEE
LNRSIDVLAK AFGEYFWQAE KVNG