Gene BCAH820_4844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_4844 
Symbol 
ID7190800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp4581226 
End bp4584885 
Gene Length3660 bp 
Protein Length1219 aa 
Translation table11 
GC content55% 
IMG OID643558254 
Productcollagen triple helix repeat domain protein 
Protein accessionYP_002453790 
Protein GI218905956 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones325 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAAGGAA ATGGTGGTAA ATCCAAAATA AAAAGTCCAT TAAATTCTAA TTTCAAGATA 
TTGTCAGATC TAGTTGGCCC TACTTTTCCT CCAGTTCCAA CTGGAATGAC AGGGATAACG
GGAAGTACGG GAGCGACGGG AACCACGGGA GCAACCGGAG AAACAGGTCC AACGGGAAGT
ACGGGAGCAA CGGGAACCAC GGGAGCAACC GGAGAAACAG GTCCAACGGG AAGTACGGGA
GCAACGGGAA CCACGGGAGC AACCGGAGAA ACAGGTCCAA CGGGAAGTAC GGGAGCAACA
GGAAGCACCG GGGTAACTGG AAATACGGGT TCAACGGGAA GTACGGGAGC AACGGGAACC
ACGGGAGCAA CCGGAGAAAC AGGTCCAACG GGAAGTACGG GAGCAACAGG AAGCACCGGG
GTAACTGGAA ATACGGGTTC AACGGGAAGT ACGGGAGCGA CGGGAACCAC GGGAGCAACC
GGAGAAACGG GAGCGACGGG GCCAACGGGA AGCATCGGAG CAATGGGAAC CACGGGAGCA
ACCGGAGAAA CGGGATCGAC AGGAAACACG GGTCCAACGG GAAGTACTGG GGTAACAGGA
AGTACGGGTC CAACAGGAGA AACGGGAGCA ACAGGAGAAA CGGGATCGAC AGGAAACACG
GGTCCAACGG GAGAAACGGG AGCAACAGGA AGTACTGGGG TAACAGGAAA CACGGGTCCA
ACAGGAGAAA CGGGAGCAAC AGGAAGTACG GGTCCAACAG GAAATACGGG AGCGACGGGA
AATACCGGTC CGACAGGAGA AACGGGAGCA ACAGGAAGTA CTGGGGTAAC AGGAAACACG
GGAGCAACCG GAGCAACAGG TCCAACGGGA AGTACGGGAG CAACGGGAAG TACGGGTCCA
ACGGGAGAAA CGGGAGTGAC AGGAAATACG GGTCCAACAG GAGAAACGGG AGTAACAGGA
AGCACCGGGG TAACAGGAAA TACGGGTCCA ACAGGAGAAA CGGGAGTAAC AGGAAGCACC
GGGGTAACAG GAAATACGGG TCCAACAGGA AGCACCGGAG CAACGGGAAA CACGGGTCCA
ACAGGAGAAA CGGGAGCAAC AGGAAGTACT GGGGTAACAG GAAGTACGGG AGCGACAGGA
AACACGGGAG CAACGGGAAA TACAGGAGCA ACAGGAAGTA CGGGTCCAAC AGGAAATACG
GGAGCGACGG GAAATACCGG TCCGACAGGA AACACGGGAG CAACAGGAGT AACAGGAAGC
ACCGGGGTAA CAGGAAACAC GGGTCCAACC GGAGCAACAG GTCCAACGGG AAGTACGGGA
GCAACGGGAA GTACGGGTCC AACGGGAGAA ACGGGAGTGA CAGGAAATAC GGGTCCAACA
GGAGAAACGG GAGTAACAGG AAGCACCGGG GTAACAGGAA ATACGGGTCC AACAGGAAGC
ACCGGAGCAA CGGGAAACAC GGGTCCAACA GGAGAAACGG GAGCAACAGG AAGTACTGGG
GTAACAGGAA GTACGGGAGC AACAGGAAGT ACTGGGGTAA CAGGAAGTAC GGGAGCGACA
GGAAACACGG GAGCAACGGG AAATACAGGA GCAACAGGAA GTACGGGTCC AACAGGAAAT
ACGGGAGCGA CGGGAAATAC GGGTCCAACA GGAAGCACGG GAGCGACGGG AAATACGGGT
CCAACAGGAA GCACGGGAGC AACAGGAGTA ACAGGAAGCA CCGGAGCAAC GGGAAACACG
GGTCCAACAG GAGAAACGGG AGCAACAGGA AGTACTGGGG TAACAGGAAG TACGGGAGCA
ACAGGAAGTA CTGGGGTAAC AGGAAGTACG GGAGCAACGG GAAATACAGG AGCAACAGGA
AGTACGGGTC CAACAGGAAA TACGGGAGCG ACGGGAAGCA CAGGTCCGAC CGGAAGCACC
GGAGCAACGG GAAACACGGG TCCAACGGGA AGTACGGGAG AAACGGGAGC GACGGGAAAT
ACCGGTCCAA CGGGAGAAAC GGGAGTGACA GGAAGTACGG GTCCAACCGG AAACACGGGA
GCAACCGGAG AAACGGGAGT GACAGGTCCG ACCGGAAGTA CGGGAGTGAC GGGAAGCACA
GGTCCGACCG GAAGCACCGG AGCAACGGGA AACACGGGAG CAACCGGAGA AACAGGTCCA
ACCGGAAACA CGGGAGTGAC AGGAAGTACG GGTCCAACGG GAGAAACGGG AGTGACAGGA
GGTACGGGTC CAACAGGAGA AACGGGAGTG ACAGGAAGTA CGGGTCCAAC AGGAAGCACC
GGGGTAACCG GAAATACGGG AGCAACCGGA GAAACAGGTC CAACGGGAAG TACGGGTCCA
ACGGGAGAAA CGGGAGTGAC AGGAAATACG GGTCCAACAG GAGAAACGGG AGTAACAGGA
AGCACCGGGG TAACAGGAAA TACGGGTCCA ACAGGAAGCA CCGGAGCAAC GGGAAACACG
GGTCCAACAG GAGAAACGGG AGCAACAGGA AGTACTGGGG TAACAGGAAG TACGGGAGCA
ACAGGAAGTA CTGGGGTAAC AGGAAGTACG GGAGCAACGG GAAATACAGG AGCAACAGGA
AGTACGGGTC CAACAGGAAA TACGGGAGCG ACGGGAAATA CGGGTCCAAC AGGAGAAACG
GGAGCGACAG GAAATACGGG TCCAACAGGA GAAACGGGAG TAACAGGAAG CACCGGGGTA
ACAGGAAATA CGGGTCCAAC AGGAAGCACC GGAGCAACGG GAAACACGGG TCCAACAGGA
GAAACGGGAG CAACAGGAAG TACTGGGGTA ACAGGAAGTA CGGGAGCAAC AGGAAGTACT
GGGGTAACAG GAAGTACAGG AGCAACGGGA AATACAGGAG CAACAGGAAG TACGGGAGCA
ACAGGAAATA CGGGAGCGAC GGGAAGCACC GGAGCAACGG GAAACACGGG TCCAACCGGA
GAAACAGGTC CAACGGGAAG TACGGGAGTA ACAGGAAATA CGGGAGCAAC CGGAAGCACC
GGGGTAACAG GAAACACGGG AGCAACAGGA GCAACAGGAA GCACCGGTCC AACGGGAAGT
ACGGGAGTGA CAGGAAGTAC AGGGGCAACA GGAAGCATCG GAGCAACGGG AGCAACAGGA
AGCACAGGTC CAACAGGAAG CACCGGAGCA ATGGGAGTAA CGGGAAGTAC AGGTCCGACC
GGAAGCACCG GAACAACAGG AAATACGGGA GTAACAGGAG ATACCGGTCC AACAGGAGCG
ACCGGGGTTA GCACAACTGC AACGTACGCG TTTGCGAATA ATACATCAGG AAGTGTTATT
TCTGTTTTGT TAGGTGGCAC GAATATTCCG TTACCAAACA ATCAAAATAT TGGACCGGGA
ATAACTGTTA GTGGTGGGAA TACTGTATTT ACAGTTGCGA ATGCAGGGAA TTATTATATA
GCCTATACAA TTAATTTAAC AGCAGGTTTA CTTGTAAGTT CTCGTATAAC TGTAAATGGC
AGTCCGCTTG CGGGAACGAT AAACTCCCCG ACAGTGGCTA CTGGTTCATT TAGTGCAACA
ATAATTGCTA GCTTGCCTGC TGGAGCTGCC GTTAGCTTAC AACTATTTGG AGTAGTTGCG
TTGGCTACAT TATCTACGGC AACGCCAGGA GCTACTTTAA CGATTATTAG ATTGAGTTAA
 
Protein sequence
MEGNGGKSKI KSPLNSNFKI LSDLVGPTFP PVPTGMTGIT GSTGATGTTG ATGETGPTGS 
TGATGTTGAT GETGPTGSTG ATGTTGATGE TGPTGSTGAT GSTGVTGNTG STGSTGATGT
TGATGETGPT GSTGATGSTG VTGNTGSTGS TGATGTTGAT GETGATGPTG SIGAMGTTGA
TGETGSTGNT GPTGSTGVTG STGPTGETGA TGETGSTGNT GPTGETGATG STGVTGNTGP
TGETGATGST GPTGNTGATG NTGPTGETGA TGSTGVTGNT GATGATGPTG STGATGSTGP
TGETGVTGNT GPTGETGVTG STGVTGNTGP TGETGVTGST GVTGNTGPTG STGATGNTGP
TGETGATGST GVTGSTGATG NTGATGNTGA TGSTGPTGNT GATGNTGPTG NTGATGVTGS
TGVTGNTGPT GATGPTGSTG ATGSTGPTGE TGVTGNTGPT GETGVTGSTG VTGNTGPTGS
TGATGNTGPT GETGATGSTG VTGSTGATGS TGVTGSTGAT GNTGATGNTG ATGSTGPTGN
TGATGNTGPT GSTGATGNTG PTGSTGATGV TGSTGATGNT GPTGETGATG STGVTGSTGA
TGSTGVTGST GATGNTGATG STGPTGNTGA TGSTGPTGST GATGNTGPTG STGETGATGN
TGPTGETGVT GSTGPTGNTG ATGETGVTGP TGSTGVTGST GPTGSTGATG NTGATGETGP
TGNTGVTGST GPTGETGVTG GTGPTGETGV TGSTGPTGST GVTGNTGATG ETGPTGSTGP
TGETGVTGNT GPTGETGVTG STGVTGNTGP TGSTGATGNT GPTGETGATG STGVTGSTGA
TGSTGVTGST GATGNTGATG STGPTGNTGA TGNTGPTGET GATGNTGPTG ETGVTGSTGV
TGNTGPTGST GATGNTGPTG ETGATGSTGV TGSTGATGST GVTGSTGATG NTGATGSTGA
TGNTGATGST GATGNTGPTG ETGPTGSTGV TGNTGATGST GVTGNTGATG ATGSTGPTGS
TGVTGSTGAT GSIGATGATG STGPTGSTGA MGVTGSTGPT GSTGTTGNTG VTGDTGPTGA
TGVSTTATYA FANNTSGSVI SVLLGGTNIP LPNNQNIGPG ITVSGGNTVF TVANAGNYYI
AYTINLTAGL LVSSRITVNG SPLAGTINSP TVATGSFSAT IIASLPAGAA VSLQLFGVVA
LATLSTATPG ATLTIIRLS