Gene BAS4435 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4435 
Symbol 
ID2851621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4343860 
End bp4345059 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content34% 
IMG OID637507672 
Producthypothetical protein 
Protein accessionYP_030682 
Protein GI49187430 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATCAG AAGTAACTGT AAAAATAAAA CCGAAATTTA TAAAAGAAAT TAAAAGTGGA 
TATCCGCTTA TTTTAAAAGA TGCGATTCAA AATTTAAATG ATGTTCAGGA AGAAGGAACA
ATCATTAAAG TGGTAGATGA GAAGAACCAC TTTGTCGGAA AAGGTTATTA TGGAAAACAA
AATAAAGGAT ACGGTTGGAT TTTAACGAGA AAAGAGAGTG AACAAATTAA TCAATCTTTC
TTTGAAAGTA AAATAAAATC TGCCTTACAT AAACGAAAGC AATTTTACAA ATCAAGTGAT
ACGACAGCAT TCCGCGCCCT AAACGGTGAA GGTGATGGTC TTGGTGGCTT AATCATCGAT
TATTATGACG GTTATTATGT AGTGAGCTGG TATAGCGAAG GGATTTATAC TTTCAGAGAT
GAGATTATAG CTGCTCTCCA AAAAGTAGCA AACTTTAAAG GTATTTATGA GAAGAAGCGT
TTTGATACGA AAGGGAAATA CATTGAAGGC GATGATTTCG TAGCAGGAGA GCGCGGTGAG
TTCCCGCTTA TCGTAAAAGA GAATGGTGTG AACTTCGCTG TTTATTTAAA TGACGGAGCG
ATGGTTGGTG TCTTTTTAGA TCAACGTAAC GTGCGAAAAC AAATTCGTGA TAAATATGCG
AAGGGAAGAA CCGTGTTAAA TATGTTCTCT TATACAGGTG CTTTTTCTGT ATTTGCAGCG
CTTGGTGGGG CGAGTAAAAC GACGAGTGTC GACCTTGCAA ATCGTAGTTT AAGTAAAACA
ATTGAGCAGT TTAGTGTAAA TGAAATTGAT TATGAAGCAC AAGATATTAT CGTAGAAGAT
GTATTCCTTT ACTTTAAATA TGCAGCGAAG AAAAAGATGA AGTTTGATAT GGTTGTATTA
GACCCTCCAA GCTTTGCACG CTCGAAAAAA TATACATTTA GTGCAGCAAA AGATTATAAA
AATTTATTAA AAGAAACAAT TGCCATTACA GAAAATAACG GTATTATCGT TGCTTCTACA
AATTGTAGTG CATTCGATAT GAAAAAGTTT AAAGGCTTTA TTGATACAGC ATTTAAAGAA
ATGAATGGTA AATATAAAGT ATTAGAAGAA CATTCTTTAC CAGAAGATTT CCGTACAATC
GATCAATTTA AAGAAGGAGA CTATTTAAAA GTAGTTTTCA TCGAGAAAAT TAAAGGGTAA
 
Protein sequence
MRSEVTVKIK PKFIKEIKSG YPLILKDAIQ NLNDVQEEGT IIKVVDEKNH FVGKGYYGKQ 
NKGYGWILTR KESEQINQSF FESKIKSALH KRKQFYKSSD TTAFRALNGE GDGLGGLIID
YYDGYYVVSW YSEGIYTFRD EIIAALQKVA NFKGIYEKKR FDTKGKYIEG DDFVAGERGE
FPLIVKENGV NFAVYLNDGA MVGVFLDQRN VRKQIRDKYA KGRTVLNMFS YTGAFSVFAA
LGGASKTTSV DLANRSLSKT IEQFSVNEID YEAQDIIVED VFLYFKYAAK KKMKFDMVVL
DPPSFARSKK YTFSAAKDYK NLLKETIAIT ENNGIIVAST NCSAFDMKKF KGFIDTAFKE
MNGKYKVLEE HSLPEDFRTI DQFKEGDYLK VVFIEKIKG