Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BAS4467 |
Symbol | |
ID | 2851582 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. Sterne |
Kingdom | Bacteria |
Replicon accession | NC_005945 |
Strand | - |
Start bp | 4378591 |
End bp | 4379676 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 637507704 |
Product | M42 family peptidase |
Protein accession | YP_030714 |
Protein GI | 49187462 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0619409 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAAAT TAGACGAGAC ATTGACAATG CTAAAAGAAT TAACAGATGC ACGCGGTATT GCGGGTAACG AGCGTGAACC ACGTGAAGTA ATGAAGAAAT ATATTGAGCC GTTTGCGGAC GAACTTTCTA CTGATAATTT AGGAAGTTTA GTTGCGAAAA AAGTAGGGGA AGAAAACGGC CCGAAAATTA TGGTTGCAGG TCATTTAGAT GAAGTTGGCT TTATGATTAC GCAAATTGAT GACAAAGGTT TCCTTCGTTT CCAAACAGTT GGTGGCTGGT GGTCACAAGT TATGCTTGCG CAGCGCGTGA CAATTGTAAC GCGTAAAGGA GATGTAACAG GTGTAATCGG TTCAAAACCA CCGCATATCT TACCTCCAGA AGCGCGTAAA AAGCCAGTTG AAATTAAAGA CATGTTCATC GATATTGGTG CTTCTAGCCA AGAAGAAGCA ATGGAGTGGG GCATACGACC AGGAGATCAA GTTGTACCTT ACTTTGAATT CCAAGTGATG AAGAATGAAA AAATGTTACT TGCAAAAGCA TGGGATAACC GAATTGGTTG TGCAATTGCA ATTGACGTAT TAAAACAATT AAAAAATGAA AAGCATCCAA ACGTTGTATA CGGCGTAGGG ACTGTACAAG AAGAAGTCGG TCTTCGTGGT GCGAAAACAT CTGCAAACTA TATTAAACCA GATATCGCAT TCGCAGTAGA TGTTGGTATC GCTGGTGACA CACCGGGGGT AACGTCAAAA GAAGCGCAAA GTAAAATGGG TGATGGACCA CAGATCATTT TATATGATGC TTCTGTTATC GGTCATACCG GTTTACGTGA CTTTGTAGTT GATGTTGCTG ATGAATTACA AATTCCGTAT CAATATGATT CAGTAGCGGG CGGTGGAACG GATGCAGGTG CAATTCATAT TGCTGTAAAT GGTATTCCGT CTATGGCAAT TACCATTGCA ACGCGTTACA TTCATTCTCA TGCGGCAATG TTACACCGTG ATGACTATGA AAATGCAGTG AAGTTAATTG TAGAAGTTAT TAAACGTCTT GATAAAGAGG CTGTACATAA CATTACATTT AATTAA
|
Protein sequence | MTKLDETLTM LKELTDARGI AGNEREPREV MKKYIEPFAD ELSTDNLGSL VAKKVGEENG PKIMVAGHLD EVGFMITQID DKGFLRFQTV GGWWSQVMLA QRVTIVTRKG DVTGVIGSKP PHILPPEARK KPVEIKDMFI DIGASSQEEA MEWGIRPGDQ VVPYFEFQVM KNEKMLLAKA WDNRIGCAIA IDVLKQLKNE KHPNVVYGVG TVQEEVGLRG AKTSANYIKP DIAFAVDVGI AGDTPGVTSK EAQSKMGDGP QIILYDASVI GHTGLRDFVV DVADELQIPY QYDSVAGGGT DAGAIHIAVN GIPSMAITIA TRYIHSHAAM LHRDDYENAV KLIVEVIKRL DKEAVHNITF N
|
| |