Gene GBAA_4815 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_4815 
Symbol 
ID2815930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp4378029 
End bp4379114 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content40% 
IMG OID637791492 
ProductM42 family peptidase 
Protein accessionYP_021461 
Protein GI47530112 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000499602 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAAAT TAGACGAGAC ATTGACAATG CTAAAAGAAT TAACAGATGC ACGCGGTATT 
GCGGGTAACG AGCGTGAACC ACGTGAAGTA ATGAAGAAAT ATATTGAGCC GTTTGCGGAC
GAACTTTCTA CTGATAATTT AGGAAGTTTA GTTGCGAAAA AAGTAGGGGA AGAAAACGGC
CCGAAAATTA TGGTTGCAGG TCATTTAGAT GAAGTTGGCT TTATGATTAC GCAAATTGAT
GACAAAGGTT TCCTTCGTTT CCAAACAGTT GGTGGCTGGT GGTCACAAGT TATGCTTGCG
CAGCGCGTGA CAATTGTAAC GCGTAAAGGA GATGTAACAG GTGTAATCGG TTCAAAACCA
CCGCATATCT TACCTCCAGA AGCGCGTAAA AAGCCAGTTG AAATTAAAGA CATGTTCATC
GATATTGGTG CTTCTAGCCA AGAAGAAGCA ATGGAGTGGG GCATACGACC AGGAGATCAA
GTTGTACCTT ACTTTGAATT CCAAGTGATG AAGAATGAAA AAATGTTACT TGCAAAAGCA
TGGGATAACC GAATTGGTTG TGCAATTGCA ATTGACGTAT TAAAACAATT AAAAAATGAA
AAGCATCCAA ACGTTGTATA CGGCGTAGGG ACTGTACAAG AAGAAGTCGG TCTTCGTGGT
GCGAAAACAT CTGCAAACTA TATTAAACCA GATATCGCAT TCGCAGTAGA TGTTGGTATC
GCTGGTGACA CACCGGGGGT AACGTCAAAA GAAGCGCAAA GTAAAATGGG TGATGGACCA
CAGATCATTT TATATGATGC TTCTGTTATC GGTCATACCG GTTTACGTGA CTTTGTAGTT
GATGTTGCTG ATGAATTACA AATTCCGTAT CAATATGATT CAGTAGCGGG CGGTGGAACG
GATGCAGGTG CAATTCATAT TGCTGTAAAT GGTATTCCGT CTATGGCAAT TACCATTGCA
ACGCGTTACA TTCATTCTCA TGCGGCAATG TTACACCGTG ATGACTATGA AAATGCAGTG
AAGTTAATTG TAGAAGTTAT TAAACGTCTT GATAAAGAGG CTGTACATAA CATTACATTT
AATTAA
 
Protein sequence
MTKLDETLTM LKELTDARGI AGNEREPREV MKKYIEPFAD ELSTDNLGSL VAKKVGEENG 
PKIMVAGHLD EVGFMITQID DKGFLRFQTV GGWWSQVMLA QRVTIVTRKG DVTGVIGSKP
PHILPPEARK KPVEIKDMFI DIGASSQEEA MEWGIRPGDQ VVPYFEFQVM KNEKMLLAKA
WDNRIGCAIA IDVLKQLKNE KHPNVVYGVG TVQEEVGLRG AKTSANYIKP DIAFAVDVGI
AGDTPGVTSK EAQSKMGDGP QIILYDASVI GHTGLRDFVV DVADELQIPY QYDSVAGGGT
DAGAIHIAVN GIPSMAITIA TRYIHSHAAM LHRDDYENAV KLIVEVIKRL DKEAVHNITF
N