Gene BAS4591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4591 
Symbol 
ID2850431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4492721 
End bp4493794 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content41% 
IMG OID637507827 
ProductM42 family peptidase 
Protein accessionYP_030837 
Protein GI49187584 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATAAAG AGACATTACA ATTATTTCGT ACGTTAACAG AATTACAAGG TGCATCAGGT 
TTTGAACATG ATGTGCGCCG TTTTATGAAG CAAGAATTAA GCAAATATGC AGATGAAATT
GTACAAGACG GTTTAGGTAG CGTATTTGGT CTGAAAAAAG GGGACGAAAC TGGCCCGCGT
GTTCTTGTAG CAGGTCATAT GGATGAAGTA GGTTTCATGA TTACGCAAAT TACGAAAAAC
GGAATGCTTC GTTTTCAACC GTTAGGCGGC TGGTGGAGCC AAGTACTATT AGCGCAACGC
GTACAAGTGA TGACGAAGAA TGGTCCTGTT ATTGGGGTTG TTGGTTCTAT CCCTCCTCAT
TTATTAAGTG ACGCGCAACG TGCAAAACCG ATGGATATAA AAAACATGTT AATTGATATA
GGTGCAGATA GCTATGAAGA TGCGATTGAA ATTGGTGTAA AACCAGGACA ACAAATCGTA
CCAATCTGCC CGTTTACGCC GATGGCAAAC GAAAAGAAAA TTATGGCGAA AGCTTGGGAC
AACCGTTACG GATGTGGTCT TGCAATCGAA TTACTAAAAG AATTAAAAGA CGAAACATTA
CCAAACACAT TATACTCTGG TGCGACTGTA CAAGAAGAAG TTGGTCTTCG CGGTGCACAA
ACTGCTGCAA ATATGATCCA ACCAGACATT TTCTATGCGC TTGATGCAAG TCCAGCAAAC
GATGCATCTG GTGACAAAAC ACAGTTCGGT CAATTAGGAA AAGGTGCTCT TCTTCGTATT
TACGATCGTA CGATGGTAAC ACATAGAGGA ATGCGTGAAT TCATTTTAGA TACAGCAGAA
ACAAACAACA TTCCGTACCA ATACTTTATT TCACAAGGTG GTACAGATGC GGGCCGTGTA
CATACAAGTA ACTCAGGTAT CCCATCAGCA GTAATTGGTG TTTGCGCACG TTACATTCAT
ACACATGCTT CTATTTTACA TGTTGATGAT TATGCAGCAG CGAAAGAGCT AATTACGAAG
CTTGTAAGAG CAACAGATAA AACGACGTTA GAGACAATTA AGAATAACGC GTAA
 
Protein sequence
MNKETLQLFR TLTELQGASG FEHDVRRFMK QELSKYADEI VQDGLGSVFG LKKGDETGPR 
VLVAGHMDEV GFMITQITKN GMLRFQPLGG WWSQVLLAQR VQVMTKNGPV IGVVGSIPPH
LLSDAQRAKP MDIKNMLIDI GADSYEDAIE IGVKPGQQIV PICPFTPMAN EKKIMAKAWD
NRYGCGLAIE LLKELKDETL PNTLYSGATV QEEVGLRGAQ TAANMIQPDI FYALDASPAN
DASGDKTQFG QLGKGALLRI YDRTMVTHRG MREFILDTAE TNNIPYQYFI SQGGTDAGRV
HTSNSGIPSA VIGVCARYIH THASILHVDD YAAAKELITK LVRATDKTTL ETIKNNA