Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCAH820_B0148 |
Symbol | |
ID | 7169897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus AH820 |
Kingdom | Bacteria |
Replicon accession | NC_011777 |
Strand | - |
Start bp | 123530 |
End bp | 125449 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643559526 |
Product | collagen triple helix repeat protein |
Protein accession | YP_002455030 |
Protein GI | 218848229 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 196 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGATG AGAATGAAAA AAAGTATTCA AACGAATTAG CCCAAGCTGA TTTCATATCT GCTGCAGCGT TTGATCCTAG TCTTGTAGGC CCTACATTGC CGCCAACTCC ACCATTTACA CTACCGACCG GTCCGACCGG AGCTACTGGT CCCACTGGAG CTACTGGTCC TACGGGAGCC ACTGGTTCTA CCGGAGTTAC AGGTCCTACT GGAGTTACAG GTCCTACGGG AGCTACTGGT CCTACGGGAG CCACTGGTTC TACCGGAGTT ACAGGTCCTA CGGGAGCCAC CGGTCCTAGC GGAGCCACTG GTTCTACGGG AACTACAGGA CCTACCGGAG ATACAGGTCC TACGGGAATT ACAGGACCTA CCGGAGTTAC AGGTCCTACG GGAGCTACTG GTCCTACGGG AGCTACTGGT CCTACAGGAG CCACTGGTTC TACCGGAGTT ACAGGTCCTA CTGGAGTTAC AGGTCCTACG GGAGCTACTG GTCCTACGGG AGCCACTGGT TCTACCGGAG TTACAGGTCC TACTGGAATT ACAGGACCTA CTGGAGCTAC CGGAACAACT GGTTCTACAG GACCTACTGG AGTTACAGGT CCTACTGGAG CTACTGGTCC TACGGGAGCC ACTGGTCCTA CGGGAGCCAC TGGTTCTACC GGAGTTACAG GTCCTACTGG AATTACAGGA CCTACGGGAG CTACCGGAAC AACTGGTTCT ACAGGACCTA CCGGAGTTAC AGGACCTACG GGAGCTACTG GTCCTACGGG AGCCACTGGT CCTACAGGAG CCACTGGTTC TACCGGAGTT ACAGGTCCTA CGGGAGTTAC AGGTCCTACT GGAGCTACTG GTCCTACGGG AGCCACTGGT TCTACCGGAG TTACAGGTCC TACTGGAGCT ACTGGTCCTA CGGGAGCCAC TGGTTCTACC GGAGTTACAG GTCCTACGGG AGTTACAGGT CCTACGGGAG CTACTGGTTC TACTGGAGCC ACCGGTCCTA CTGGAGCCAC TGGTTCTACC GGAGTTACAG GTCCTACGGG AGCCACTGGT CCTACGGGAG CCACTGGTCC TACGGGAGCC ACTGGTTCTA CCGGAGTTAC AGGTCCTACT GGAATTACAG GACCTACTGG AGCTACCGGA ACAACTGGTT CTACAGGACC TACCGGAGTT ACAGGACCTA CGGGAGTTAC AGGTCCTACG GGAGTTACAG GTCCTACGGG AGCTACTGGT CCTACGGGAG CCACTGGTTC TACCGGAGTT ACAGGTCCTA CTGGAATTAC AGGACCTACT GGAGCTACCG GAACAACTGG TTCTACAGGA CCTACCGGAG TTACAGGTCC TACGGGAGTT ACAGGACCTA CGGGAGCTAC TGGTCCTACG GGAGTTACAG GACCTACCGG AGCTACAGGA CCTACCGGAG CTACTGGCGC TACAGCAACA ACTTCTACAA AGGCTATCCT TTTTGGTGGT ACTAATGCGG GATTTCAGCG CATAGCAGGA TCACCTGGTG CAGATTCACA AACGCTCCCT TATGTAACAG CAGGAGCTGG TAGTGTTGTT GCGTTTTCTG CTTCTATAAA TGTAAATAAC TTAGGTACAG GTGTATATTT GTTGCGAGTA TGTGATAATG TACCTACTAA TCTAGCTTCA CCGGGTGCTG GTCAAATAGT CTCTACAATT ACGCTTACAC TTACAGCCAA TATTACTGGA ACTATAGTGT TTTCGATTAA ACCAACTGAT ATTGGAGCAC AACCTGTAAA GGTATTTAAT CCTAATCCAG TGGTAGCCCC TGCAACAGTT ACATGGACTA GTACAATACC TGGTAATCCA GTTGCAAGGA CAGATGCTAT CTCACTTTTT ATAACTCCAG GAATTACTCA AAGTGCTGTA TACTCTGTAT TTATATCTAC TGCAGTTTAA
|
Protein sequence | MSDENEKKYS NELAQADFIS AAAFDPSLVG PTLPPTPPFT LPTGPTGATG PTGATGPTGA TGSTGVTGPT GVTGPTGATG PTGATGSTGV TGPTGATGPS GATGSTGTTG PTGDTGPTGI TGPTGVTGPT GATGPTGATG PTGATGSTGV TGPTGVTGPT GATGPTGATG STGVTGPTGI TGPTGATGTT GSTGPTGVTG PTGATGPTGA TGPTGATGST GVTGPTGITG PTGATGTTGS TGPTGVTGPT GATGPTGATG PTGATGSTGV TGPTGVTGPT GATGPTGATG STGVTGPTGA TGPTGATGST GVTGPTGVTG PTGATGSTGA TGPTGATGST GVTGPTGATG PTGATGPTGA TGSTGVTGPT GITGPTGATG TTGSTGPTGV TGPTGVTGPT GVTGPTGATG PTGATGSTGV TGPTGITGPT GATGTTGSTG PTGVTGPTGV TGPTGATGPT GVTGPTGATG PTGATGATAT TSTKAILFGG TNAGFQRIAG SPGADSQTLP YVTAGAGSVV AFSASINVNN LGTGVYLLRV CDNVPTNLAS PGAGQIVSTI TLTLTANITG TIVFSIKPTD IGAQPVKVFN PNPVVAPATV TWTSTIPGNP VARTDAISLF ITPGITQSAV YSVFISTAV
|
| |