Gene BCAH820_B0148 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_B0148 
Symbol 
ID7169897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011777 
Strand
Start bp123530 
End bp125449 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content52% 
IMG OID643559526 
Productcollagen triple helix repeat protein 
Protein accessionYP_002455030 
Protein GI218848229 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones196 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGATG AGAATGAAAA AAAGTATTCA AACGAATTAG CCCAAGCTGA TTTCATATCT 
GCTGCAGCGT TTGATCCTAG TCTTGTAGGC CCTACATTGC CGCCAACTCC ACCATTTACA
CTACCGACCG GTCCGACCGG AGCTACTGGT CCCACTGGAG CTACTGGTCC TACGGGAGCC
ACTGGTTCTA CCGGAGTTAC AGGTCCTACT GGAGTTACAG GTCCTACGGG AGCTACTGGT
CCTACGGGAG CCACTGGTTC TACCGGAGTT ACAGGTCCTA CGGGAGCCAC CGGTCCTAGC
GGAGCCACTG GTTCTACGGG AACTACAGGA CCTACCGGAG ATACAGGTCC TACGGGAATT
ACAGGACCTA CCGGAGTTAC AGGTCCTACG GGAGCTACTG GTCCTACGGG AGCTACTGGT
CCTACAGGAG CCACTGGTTC TACCGGAGTT ACAGGTCCTA CTGGAGTTAC AGGTCCTACG
GGAGCTACTG GTCCTACGGG AGCCACTGGT TCTACCGGAG TTACAGGTCC TACTGGAATT
ACAGGACCTA CTGGAGCTAC CGGAACAACT GGTTCTACAG GACCTACTGG AGTTACAGGT
CCTACTGGAG CTACTGGTCC TACGGGAGCC ACTGGTCCTA CGGGAGCCAC TGGTTCTACC
GGAGTTACAG GTCCTACTGG AATTACAGGA CCTACGGGAG CTACCGGAAC AACTGGTTCT
ACAGGACCTA CCGGAGTTAC AGGACCTACG GGAGCTACTG GTCCTACGGG AGCCACTGGT
CCTACAGGAG CCACTGGTTC TACCGGAGTT ACAGGTCCTA CGGGAGTTAC AGGTCCTACT
GGAGCTACTG GTCCTACGGG AGCCACTGGT TCTACCGGAG TTACAGGTCC TACTGGAGCT
ACTGGTCCTA CGGGAGCCAC TGGTTCTACC GGAGTTACAG GTCCTACGGG AGTTACAGGT
CCTACGGGAG CTACTGGTTC TACTGGAGCC ACCGGTCCTA CTGGAGCCAC TGGTTCTACC
GGAGTTACAG GTCCTACGGG AGCCACTGGT CCTACGGGAG CCACTGGTCC TACGGGAGCC
ACTGGTTCTA CCGGAGTTAC AGGTCCTACT GGAATTACAG GACCTACTGG AGCTACCGGA
ACAACTGGTT CTACAGGACC TACCGGAGTT ACAGGACCTA CGGGAGTTAC AGGTCCTACG
GGAGTTACAG GTCCTACGGG AGCTACTGGT CCTACGGGAG CCACTGGTTC TACCGGAGTT
ACAGGTCCTA CTGGAATTAC AGGACCTACT GGAGCTACCG GAACAACTGG TTCTACAGGA
CCTACCGGAG TTACAGGTCC TACGGGAGTT ACAGGACCTA CGGGAGCTAC TGGTCCTACG
GGAGTTACAG GACCTACCGG AGCTACAGGA CCTACCGGAG CTACTGGCGC TACAGCAACA
ACTTCTACAA AGGCTATCCT TTTTGGTGGT ACTAATGCGG GATTTCAGCG CATAGCAGGA
TCACCTGGTG CAGATTCACA AACGCTCCCT TATGTAACAG CAGGAGCTGG TAGTGTTGTT
GCGTTTTCTG CTTCTATAAA TGTAAATAAC TTAGGTACAG GTGTATATTT GTTGCGAGTA
TGTGATAATG TACCTACTAA TCTAGCTTCA CCGGGTGCTG GTCAAATAGT CTCTACAATT
ACGCTTACAC TTACAGCCAA TATTACTGGA ACTATAGTGT TTTCGATTAA ACCAACTGAT
ATTGGAGCAC AACCTGTAAA GGTATTTAAT CCTAATCCAG TGGTAGCCCC TGCAACAGTT
ACATGGACTA GTACAATACC TGGTAATCCA GTTGCAAGGA CAGATGCTAT CTCACTTTTT
ATAACTCCAG GAATTACTCA AAGTGCTGTA TACTCTGTAT TTATATCTAC TGCAGTTTAA
 
Protein sequence
MSDENEKKYS NELAQADFIS AAAFDPSLVG PTLPPTPPFT LPTGPTGATG PTGATGPTGA 
TGSTGVTGPT GVTGPTGATG PTGATGSTGV TGPTGATGPS GATGSTGTTG PTGDTGPTGI
TGPTGVTGPT GATGPTGATG PTGATGSTGV TGPTGVTGPT GATGPTGATG STGVTGPTGI
TGPTGATGTT GSTGPTGVTG PTGATGPTGA TGPTGATGST GVTGPTGITG PTGATGTTGS
TGPTGVTGPT GATGPTGATG PTGATGSTGV TGPTGVTGPT GATGPTGATG STGVTGPTGA
TGPTGATGST GVTGPTGVTG PTGATGSTGA TGPTGATGST GVTGPTGATG PTGATGPTGA
TGSTGVTGPT GITGPTGATG TTGSTGPTGV TGPTGVTGPT GVTGPTGATG PTGATGSTGV
TGPTGITGPT GATGTTGSTG PTGVTGPTGV TGPTGATGPT GVTGPTGATG PTGATGATAT
TSTKAILFGG TNAGFQRIAG SPGADSQTLP YVTAGAGSVV AFSASINVNN LGTGVYLLRV
CDNVPTNLAS PGAGQIVSTI TLTLTANITG TIVFSIKPTD IGAQPVKVFN PNPVVAPATV
TWTSTIPGNP VARTDAISLF ITPGITQSAV YSVFISTAV