Gene BAS1899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS1899 
Symbol 
ID2851913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp1918966 
End bp1920072 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content31% 
IMG OID637505150 
Productspore coat protein H 
Protein accessionYP_028163 
Protein GI49184911 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5337] Spore coat assembly protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGAA CTGAGAAGGG ATGTGAAAAT ATGCTACCTT CATATGATTT TTTTATTCAT 
CCAATGTACG TAGTGGAATT GAAAAAAGAC ATTTGGTCAG ACAGTCCAGT ACCAGCAAAA
TTAACTTATG GAAAAAAGAA GTATGATATT GATATCGTAT ATCGGGGTGC TCATATTCGT
GAATTTGAGA AAAAGTCTTA TCATGTTATG TTTTATAAGC CAAAAAAATT TCAAGGTGCG
AAAGAGTTTC ATTTAAATTC TGAGTTTATG GATCCGTCTC TCATACGAAA TAAATTATCT
TTAGATTTTT TTCATGATAT TGGTGTACAT TCACCAAAAT CACAACATGT ATTTATAAAA
ATTAATGGTC AAATTCAAGG AGTATATTTA CAGTTAGAAT CAGTTGATGA AAACTTTTTG
AAAAATAGAG GATTACCTAG TGGTTCTATT TATTATGCGA TAGATGATGA TGCGAATTTC
TCTTTAATGA GTGAAAGAGA TAAAGATGTT AAGACTGAGC TTTTTGCGGG TTATGAATTT
AAATATTCGA ATGAACATAG TGAAGAACAA TTGAGTGAAT TTGTATTTCA AGCGAACGCT
TTGTCGAGGG AAGCGTATGA AAAAGAAATT GGGAAGTTTC TAAATGTTGA TAAGTATTTA
CGATGGTTAG CAGGCGTTAT TTTTACACAA AACTTTGATG GTTTTGTTCA TAACTATGCA
TTATACCATA ACGATGAAAC AAATTTATTT GAAGTGATAC CGTGGGATTA TGATGCGACT
TGGGGGCGTG ATGTACAAGG GAGACCGCTT AATCATGAAT ATATTCGTAT TCAAGGTTAT
AACACGTTAA GTGCAAGATT GTTAGATATA CCTGTATTTA GAAAACAATA CCGAAGTATT
TTGGAAGAAA TATTAGAAGA ACAATTTACG GTTTCATTTA TGATGCCGAA AGTAGAAAGT
TTATGTGAAG CAATACGTCC TTATTTACTA CAAGATCCAT ATATGAAAGA AAAATTAGAA
ACCTTTGATC AAGAACCTGG TGTGATTGAG GAATATATAA ATAAAAGAAG AAAGTATATA
CAAGATCATT TACATGAATT GGATTAA
 
Protein sequence
MKRTEKGCEN MLPSYDFFIH PMYVVELKKD IWSDSPVPAK LTYGKKKYDI DIVYRGAHIR 
EFEKKSYHVM FYKPKKFQGA KEFHLNSEFM DPSLIRNKLS LDFFHDIGVH SPKSQHVFIK
INGQIQGVYL QLESVDENFL KNRGLPSGSI YYAIDDDANF SLMSERDKDV KTELFAGYEF
KYSNEHSEEQ LSEFVFQANA LSREAYEKEI GKFLNVDKYL RWLAGVIFTQ NFDGFVHNYA
LYHNDETNLF EVIPWDYDAT WGRDVQGRPL NHEYIRIQGY NTLSARLLDI PVFRKQYRSI
LEEILEEQFT VSFMMPKVES LCEAIRPYLL QDPYMKEKLE TFDQEPGVIE EYINKRRKYI
QDHLHELD