Gene BAS2561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS2561 
Symbol 
ID2848392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp2561288 
End bp2562628 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content35% 
IMG OID637505807 
Producthypothetical protein 
Protein accessionYP_028820 
Protein GI49185568 
COG category 
COG ID 
TIGRFAM ID[TIGR02889] germination protein YpeB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000903794 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACGAG GTATTATAAT CGTATTATTA ACAATCGGTG TAGTCGGCAC AGGGTACTGG 
GGCTATAAAG AGCATCAAGA GAAGAATGCA GTATTAATTC GGGCAGAAAA TAGCTATCAA
CGTGCGTTTC ACGATTTAGC GTACGAGGTA GATTTATTAC ACGATAAAAT TGGTACAACG
CTTGCGATGA ATTCACGTGC ATCTTTATCA CCTGCATTAG CAGATGTATG GAGATTAACA
TCTGAAGCTC GTTCAGATGT AGGGCAACTT CCTTTAACAT TAATGCCTTT TAATAAAACG
GAAGAATTTT TAGCGAATAT CGGTGATTTT AGTTACCGTG CAGCTATTCG TGATCTAGAA
AAAGAGCCGT TAAATGATCA AGAATACAAA ACGTTGCAAA CTTTATACTC AAATGCCGGA
AATATACAAG ATGAACTAAG AAAAGTGCAA CATCTTGTTT TGAAAAACAA TTTACGCTGG
ATGGATGTTG AGATGGCACT CGCATCCAAT CGTGATCCTG CCGACAATAC AATTATTGAC
GGACTAAAGA CAGTAGAAAA AAATGTAACA TCGTATTCAT CTACAAACTT CGGACCTACC
TTTACAAGTG CACAAAAAAA TAAAAAAGGT GGATTTGAAG CAGAAGGAAA AGCAATTTCA
AAAGATGAAG CGGGGAAAAT CGCAAAATCG TTCTTGAATT TAAAAGGAAA TGAAAAAGTA
GAAGTTGAGA AAAGTGGAAA AGGTGCAAAA GAATCTTTCT ATAGTGTGAA AATTAAAGAT
GAAGCAACGA ATAATAAGTT TTATATGGAT ATTACCGGAA AAGGCGGATA TCCAATTTGG
GTTATGAATA ATCGAGAAAT TAAAGAACAG AAGATTAGTT TAAATGACGC AGGAAGTAAA
GGTTTGAAAT TCTTAAAGGA CCATAAGTTT AATAATATGG AGCTTTATGA TAGCTCACAA
TATGATAATG TTGGAGTATT TACGTATGTA GTAAATGTGA ATGGCGTACG AATTTATCCT
GAAGCAATTC AAATGAAAAT TGCTTTAGAT GACGGTTCTA TCGTTGGATT CTCCGCAAAA
GAATATTTAG CGTCACATCA AAAACGAACA GTTCCATCAG CAAAACTAAC TGCAGCAGAA
GCAAGAAAGA AAATCAATCC AGATGTGAAA GTTATGGAAG AACGTAAAGC TGTCGTAGTA
AATGATCTGC ATAATGAAGT ACTTTGCTAT GAATTTGTAG GTACGTTAGG GAAAGATACG
TACCAAATCT TCATTAATGC AAATAGCGGA GCAGAAGAAA AAGTGAAGAA AATGCAGGCT
GTTGAAAAAA TTTATGATTA A
 
Protein sequence
MLRGIIIVLL TIGVVGTGYW GYKEHQEKNA VLIRAENSYQ RAFHDLAYEV DLLHDKIGTT 
LAMNSRASLS PALADVWRLT SEARSDVGQL PLTLMPFNKT EEFLANIGDF SYRAAIRDLE
KEPLNDQEYK TLQTLYSNAG NIQDELRKVQ HLVLKNNLRW MDVEMALASN RDPADNTIID
GLKTVEKNVT SYSSTNFGPT FTSAQKNKKG GFEAEGKAIS KDEAGKIAKS FLNLKGNEKV
EVEKSGKGAK ESFYSVKIKD EATNNKFYMD ITGKGGYPIW VMNNREIKEQ KISLNDAGSK
GLKFLKDHKF NNMELYDSSQ YDNVGVFTYV VNVNGVRIYP EAIQMKIALD DGSIVGFSAK
EYLASHQKRT VPSAKLTAAE ARKKINPDVK VMEERKAVVV NDLHNEVLCY EFVGTLGKDT
YQIFINANSG AEEKVKKMQA VEKIYD