Gene BAS1021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS1021 
Symbol 
ID2848728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp1073860 
End bp1074900 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content34% 
IMG OID637504280 
ProductS-layer protein 
Protein accessionYP_027294 
Protein GI49184042 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5386] Cell surface protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.347514 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATAATGA TTAAGAAAAA ATATATGAAT GCATTCGTTA TAGCAGCAAC TTTAGCAGTT 
CCATTTAGTA GTATTATGGC ACCGATTGCG AAGGCAGAAG CGGCGGTTGA AATGAAAGCA
GCTAGCAAAT TAGCAGATGG CACTTATGAC GTTATTTTAA AGACTTATAA AGATAAAACG
AATGATACAT CAGTTGCGTC TACATATTTA AAAAATCTAA AAGTAACCAT TCAAGGTGAT
AAAAAAATCG TTACGTTAAC AGTTCAAGAT AGTAGCTATT TCCAGTATCT TAGAGTAGAA
GATACGAATA AAGTAGGGAC ATTCCATGAT GTAAAAGTAA TTTCCGAAGA TAAAGCAAAT
AACGGTACGA AAGTTGTTCA ATTTGAAATT GATGAGTTTT CGAAAAAATA TAATATGCAA
ATGCATATAT TAATTCCAGC AATTAAATAT GATCATAAAT ATCAAGTACA GTTTGAAATC
GACGCGAGTG CAATTGAACA GAAGCCTAAA TTCTCAGATG TACCAACTTG GGCACAAGAG
TCAGTTCAAT ATTTAGTAGA TAAAGAAGCA GTGCACGGTA AACCAGATGG TACATTTGCT
CCGGCTGAAA GTATCGATCG TAGTTCAGCT GCAAAAATAT TAGCAACTGT TTTACGGTTA
GAAATTAAGA AAGATGCAAA GCCATCATTC CCTGATGCAC AAAACCACTG GGCAACTCCA
TATATTGCTG CTGTTGAAAA AGCAGGTATT GTAAAAGGTG ATGAGAAGGG AAACTTTAAT
CCAAGCGGGT TAATTAACCG TGCATCAATG GCTTCTATGT TAGTAAATGC ATATAAATTA
GAAAGAAATG AAAATATAAA ACTACCGAAA GAATTTGCTG ACTTAAACAA TCATTGGGGT
GCGAAGTATG CCAATATTTT AATCCAAGAA AAGATTTCAA TTGGAACAGA TAATGGCTGG
GCTCCAAATA AAGCAGTAAG TCGTGCGGAA GCAGCACAAT TTATTGCGAA GGCGGATAAA
TTGAAGAAAG AAATGAAATA G
 
Protein sequence
MIMIKKKYMN AFVIAATLAV PFSSIMAPIA KAEAAVEMKA ASKLADGTYD VILKTYKDKT 
NDTSVASTYL KNLKVTIQGD KKIVTLTVQD SSYFQYLRVE DTNKVGTFHD VKVISEDKAN
NGTKVVQFEI DEFSKKYNMQ MHILIPAIKY DHKYQVQFEI DASAIEQKPK FSDVPTWAQE
SVQYLVDKEA VHGKPDGTFA PAESIDRSSA AKILATVLRL EIKKDAKPSF PDAQNHWATP
YIAAVEKAGI VKGDEKGNFN PSGLINRASM ASMLVNAYKL ERNENIKLPK EFADLNNHWG
AKYANILIQE KISIGTDNGW APNKAVSRAE AAQFIAKADK LKKEMK