Gene BCAH820_4686 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_4686 
Symbol 
ID7191194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp4439626 
End bp4440711 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content41% 
IMG OID643558096 
Productpeptidase, M42 family 
Protein accessionYP_002453632 
Protein GI218905798 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value6.124679999999999e-51 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAAAAT TAGACGAGAC ATTGACAATG CTAAAAGAAT TAACAGATGC ACGCGGTATT 
GCGGGTAACG AGCGTGAACC ACGTGAAGTA ATGAAGAAAT ATATTGAGCC GTTTGCGGAC
GAACTTTCTA CTGATAATTT AGGAAGTTTA GTTGCGAAAA AAGTAGGGGA AGAAAACGGC
CCGAAAATTA TGGTTGCAGG TCATTTAGAT GAAGTTGGCT TTATGATCAC GCAAATTGAT
GACAAAGGTT TCCTTCGTTT CCAAACAGTT GGTGGCTGGT GGTCACAAGT TATGCTTGCG
CAGCGCGTGA CAATTGTAAC GCGTAAAGGA GATGTAACAG GTGTAATCGG TTCAAAACCA
CCGCATATCT TACCTCCAGA AGCGCGTAAA AAGCCAGTTG AAATTAAAGA CATGTTCATC
GATATTGGTG CTTCTAGCCA AGAAGAAGCA ATGGAGTGGG GCGTACGACC AGGAGATCAA
GTTGTACCTT ACTTTGAATT CCAAGTGATG AAGAATGAAA AAATGTTACT TGCAAAAGCA
TGGGATAACC GAATTGGTTG TGCAATTGCA ATTGACGTAT TAAAACAATT AAAAGATGAA
AAGCATCCAA ACGTTGTATA CGGCGTAGGG ACTGTACAAG AAGAAGTCGG TCTTCGTGGT
GCGAAAACAT CTGCAAACTA TATTAAACCA GATATCGCAT TCGCAGTAGA TGTTGGTATC
GCTGGTGACA CACCGGGGGT AACGTCAAAA GAAGCGCAAA GTAAAATGGG TGATGGACCA
CAGATCATTT TATATGATGC TTCTGTTATC GGTCATACCG GTTTACGTGA CTTTGTAGTT
GATGTTGCTG ATGAATTACA AATTCCGTAT CAATATGATT CAGTAGCGGG CGGTGGAACG
GATGCAGGTG CAATTCATAT TGCTGTAAAT GGTATTCCGT CTATGGCAAT TACCATTGCA
ACGCGTTACA TTCATTCTCA TGCGGCAATG TTACACCGTG ATGACTATGA AAACGCAGTG
AAGTTAATTG TAGAAGTTAT TAAACGTCTT GATAAAGAGG CTGTACATAA CATTACATTT
AATTAA
 
Protein sequence
MTKLDETLTM LKELTDARGI AGNEREPREV MKKYIEPFAD ELSTDNLGSL VAKKVGEENG 
PKIMVAGHLD EVGFMITQID DKGFLRFQTV GGWWSQVMLA QRVTIVTRKG DVTGVIGSKP
PHILPPEARK KPVEIKDMFI DIGASSQEEA MEWGVRPGDQ VVPYFEFQVM KNEKMLLAKA
WDNRIGCAIA IDVLKQLKDE KHPNVVYGVG TVQEEVGLRG AKTSANYIKP DIAFAVDVGI
AGDTPGVTSK EAQSKMGDGP QIILYDASVI GHTGLRDFVV DVADELQIPY QYDSVAGGGT
DAGAIHIAVN GIPSMAITIA TRYIHSHAAM LHRDDYENAV KLIVEVIKRL DKEAVHNITF
N