Gene BAS4467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4467 
Symbol 
ID2851582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4378591 
End bp4379676 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content40% 
IMG OID637507704 
ProductM42 family peptidase 
Protein accessionYP_030714 
Protein GI49187462 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0619409 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAAT TAGACGAGAC ATTGACAATG CTAAAAGAAT TAACAGATGC ACGCGGTATT 
GCGGGTAACG AGCGTGAACC ACGTGAAGTA ATGAAGAAAT ATATTGAGCC GTTTGCGGAC
GAACTTTCTA CTGATAATTT AGGAAGTTTA GTTGCGAAAA AAGTAGGGGA AGAAAACGGC
CCGAAAATTA TGGTTGCAGG TCATTTAGAT GAAGTTGGCT TTATGATTAC GCAAATTGAT
GACAAAGGTT TCCTTCGTTT CCAAACAGTT GGTGGCTGGT GGTCACAAGT TATGCTTGCG
CAGCGCGTGA CAATTGTAAC GCGTAAAGGA GATGTAACAG GTGTAATCGG TTCAAAACCA
CCGCATATCT TACCTCCAGA AGCGCGTAAA AAGCCAGTTG AAATTAAAGA CATGTTCATC
GATATTGGTG CTTCTAGCCA AGAAGAAGCA ATGGAGTGGG GCATACGACC AGGAGATCAA
GTTGTACCTT ACTTTGAATT CCAAGTGATG AAGAATGAAA AAATGTTACT TGCAAAAGCA
TGGGATAACC GAATTGGTTG TGCAATTGCA ATTGACGTAT TAAAACAATT AAAAAATGAA
AAGCATCCAA ACGTTGTATA CGGCGTAGGG ACTGTACAAG AAGAAGTCGG TCTTCGTGGT
GCGAAAACAT CTGCAAACTA TATTAAACCA GATATCGCAT TCGCAGTAGA TGTTGGTATC
GCTGGTGACA CACCGGGGGT AACGTCAAAA GAAGCGCAAA GTAAAATGGG TGATGGACCA
CAGATCATTT TATATGATGC TTCTGTTATC GGTCATACCG GTTTACGTGA CTTTGTAGTT
GATGTTGCTG ATGAATTACA AATTCCGTAT CAATATGATT CAGTAGCGGG CGGTGGAACG
GATGCAGGTG CAATTCATAT TGCTGTAAAT GGTATTCCGT CTATGGCAAT TACCATTGCA
ACGCGTTACA TTCATTCTCA TGCGGCAATG TTACACCGTG ATGACTATGA AAATGCAGTG
AAGTTAATTG TAGAAGTTAT TAAACGTCTT GATAAAGAGG CTGTACATAA CATTACATTT
AATTAA
 
Protein sequence
MTKLDETLTM LKELTDARGI AGNEREPREV MKKYIEPFAD ELSTDNLGSL VAKKVGEENG 
PKIMVAGHLD EVGFMITQID DKGFLRFQTV GGWWSQVMLA QRVTIVTRKG DVTGVIGSKP
PHILPPEARK KPVEIKDMFI DIGASSQEEA MEWGIRPGDQ VVPYFEFQVM KNEKMLLAKA
WDNRIGCAIA IDVLKQLKNE KHPNVVYGVG TVQEEVGLRG AKTSANYIKP DIAFAVDVGI
AGDTPGVTSK EAQSKMGDGP QIILYDASVI GHTGLRDFVV DVADELQIPY QYDSVAGGGT
DAGAIHIAVN GIPSMAITIA TRYIHSHAAM LHRDDYENAV KLIVEVIKRL DKEAVHNITF
N