Gene BCAH820_4639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_4639 
Symbol 
ID7190396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp4394301 
End bp4395383 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content48% 
IMG OID643558049 
Productcollagen triple helix repeat domain protein 
Protein accessionYP_002453585 
Protein GI218905751 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones225 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCAT GGAGAAATAA TATAAATGGA TATTGTAGTT GTAATAATCA AAATGGTGTA 
CATGTTGATT CATGCTGTTT TAGTTGCGAT GGAACAGTTC CTAAGCTGGG TCCAACAGGT
CCGACGGGAG CAACGGGAGC AACAGGAGCA ACAGGAGTAA CGGGAGTAAC GGGAGCAACG
GGAGCAACGG GAGTAACAGG AGCGACAGGA ATAACGGGAG CGACAGGAAT AACAGGAGCA
ACAGGAATAA CGGGAGTAAC GGGAGCAACG GGAGCGACAG GAATAACAGG AGCAACGGGA
GCAACGGGAG CAACGGGAGC AACGGGAGCA ACAGGAGCAA CGGGAGCAAC GGGAGCAACA
GGAGCAACGG GAGCAACGGG TCCAACGGGA GCAACAGGTC CGACGGGAGC GACAGGAATA
ACGGGAGCAA CAGGAGCAAC AGGAGCAACA GGAGCAACGG GAGTAACGGG AGTAACAGGA
GCGACAGGAA TAACAGGAGC AACAGGAATA ACAGGAGCAA CAGGAATAAC GGGAGTAACG
GGTCCAACGG GAGCGACAGG AATAACAGGA GCAACGGGTC CAACGGGAGC GACAGGAATA
ACGGGAGCAA CAGGTCCAAC GGGAGCAACG GGTCCAACAG GAGAGATTGG TCCGACGGGA
GTCACAGGTA CAAGTATTAC GGCGACGTAT GCATTTGCAA ATAATACGTC AGGGACAGCG
ATATCGGTAC TTCTTGGTGG AACAAATGTC CCGCTTCCAA ATAATCAAAA TATCGGTCCG
GGGATTACTG TATCTGGGGG AAATACAGTA TTTACGGTTG CAAATGCTGG GAACTATTAT
ATTTCATATA CAATTAATAT AACGGCTTCG TTATTAGTCA GTTCGCGGAT TACAATTAAT
GGAGCGCCAC TTGCTGGGAC GATTAATTCT CCTGCGTTAG CAACTACATC ATTTAGTGCA
ACAATTATTA CTACTCTTGC GGCTGGAAGT GCTATTAGTT TGCAACTATT TGGCTTGTTA
GCTGTTGCAA CATTATCAAC AACTACACCA GGAGCTGTGT TAACGATAAT AAGGTTAAGT
TGA
 
Protein sequence
MSSWRNNING YCSCNNQNGV HVDSCCFSCD GTVPKLGPTG PTGATGATGA TGVTGVTGAT 
GATGVTGATG ITGATGITGA TGITGVTGAT GATGITGATG ATGATGATGA TGATGATGAT
GATGATGPTG ATGPTGATGI TGATGATGAT GATGVTGVTG ATGITGATGI TGATGITGVT
GPTGATGITG ATGPTGATGI TGATGPTGAT GPTGEIGPTG VTGTSITATY AFANNTSGTA
ISVLLGGTNV PLPNNQNIGP GITVSGGNTV FTVANAGNYY ISYTINITAS LLVSSRITIN
GAPLAGTINS PALATTSFSA TIITTLAAGS AISLQLFGLL AVATLSTTTP GAVLTIIRLS