Gene BCAH820_3721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCAH820_3721 
Symbol 
ID7189851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus AH820 
KingdomBacteria 
Replicon accessionNC_011773 
Strand
Start bp3553180 
End bp3555300 
Gene Length2121 bp 
Protein Length706 aa 
Translation table11 
GC content50% 
IMG OID643557132 
Productcollagen triple helix repeat protein 
Protein accessionYP_002452671 
Protein GI218904837 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value7.98705e-33 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTCGTT ATGACGACAG TCAAAACAAA TTCTCCAAAC CATGCTTTCC AAGTAGCGCT 
GGACGAATCC CGAATACTCC ATCAATCCCA GTTACTAAGG CACAACTTAG AACATTTCGC
GCAATCATTA TTGATTTAAC AAAAATAATC CCAAAACTTT TCGCAAATCC ATCTCCCCAA
AATATTGAAG ATCTAATCGA TACATTGAAC CTACTAAGTA AATTTATTTG TTCACTAGAC
GCTGCTTCCT CCCTGAAAGC ACAAGGATTA GCTATTATTA AAAACTTAAT AACTATATTA
AAAAACCCAA CTTTCGTAGC AAGTGCTGTA TTTATCGAGC TTCAAAATCT AATTAATTAT
TTACTATCCA TTACAAAACT ATTCCGAATT GACCCTTGCA CACTTCAAGA GCTTCTTAAA
TTAATAGCAG CATTACAAAC CGCTTTAGTT AATTCTGCTT CATTCATTCA AGGACCTACT
GGACCTACTG GGCCAGCTGG TGCTACTGGT GCCACTGGAC CTCAGGGTGC TCAAGGTAAC
ACAGGTGCTA CTGGTGCCAC TGGACCTCAA GGCGCTCAAG GTAACACAGG TGCTACTGGT
GCCACTGGAC CTCAAGGCGC TCAAGGTAAC ACAGGTGCTA CTGGTGCCAC TGGACCTCAA
GGCGCTCAAG GTAACACAGG TGCTACTGGT GCCACTGGAC CTCAAGGCGC TCAAGGTAAC
ACGGGCGCTA CTGGACCTCA GGGTGCTCAA GGTAACACGG GCGCTACTGG ACCTCAGGGT
GTTCAAGGTA ACACGGGCGC TACTGGTGCC ACTGGACCTC AAGGCGCTCA AGGTAACACA
GGTGCTACTG GTGCCACTGG ACCTCAGGGT GCTCAAGGTA ACACAGGCGC TACTGGACCT
CAAGGCGCTC AAGGACCAGC GGGTGCTACT GGTGCTACCG GTGCTACCGG TGCCACTGGA
CCTCAAGGTG TTCAAGGACC AGCAGGTACT ACCGGTGCTA CTGGACCTCA AGGTGCTCAA
GGACCAGCAG GTGCTACTGG CGCTACTGGA CCTCAAGGTG CTACCGGTGC TACCGGACCT
CAAGGTGCTC AAGGACCAGC TGGTGCTACC GGTGCCACTG GACCTCAAGG TGCTCAAGGA
CCAGCTGGTG CTACCGGTGC TACCGGACCT CAAGGCGTTC AAGGACCAGC AGGCGCTACC
GGTGCCACTG GACCTCAAGG TGCTCAAGGA CCAGCAGGTG CTACTGGTGC CACTGGACCT
CAAGGCGTTC AAGGACCAGC AGGTGCTACT GGTGCCACTG GACCTCAAGG CGTTCAAGGA
CCAGCAGGTG CTACCGGTGC CACTGGACCT CAAGGCGTTC AAGGACCAGC AGGTGCTACT
GGTGCCACTG GACCTCAAGG TGTTCAAGGA CCAGCAGGTG CTACTGGCGC TACTGGACCT
CAAGGCGTTC AAGGGCCAAC GGGTGCTACT GGTATAGGAG TTACCGGACC TACTGGGCCT
TCTGGTGGAC CTACTGGACC TACTGGACCT CAGGGACCTC AAGGTAATAC AGGTGCTACT
GGACCTCAAG GTATTCAAGG GCCTGCTGGT GCTACTGGTG CCACTGGACC TCAAGGTGCT
CAAGGACCGG CTGGTGCTAC CGGCGCTACT GGACCTCAAG GTGTTCAAGG GCCAACGGGT
GCTACTGGTA TAGGAGTCAC CGGACCTACT GGGCCTTCTG GACCTAGCTT CCCTGTAGCA
ACAATTGTTG TAACAAACAA CATTCAACAA ACAGTACTCC AATTTAACAA CTTCATTTTT
AATACTGCAA TTAACGTAAA CAACATTATC TTCAACGGCA CAGATACAGT TACTGTTATC
AACGCTGGTA TTTATGTCAT TAGCGTATCC ATCTCTACAA CTGCACCAGG ATGTGCACCA
CTCGGAGTAG GAATTTCAAT AAATGGAGCA GTCGCAACTG ACAACTTCTC TTCAAATCTA
ATAGGCGACT CACTTTCATT CACTACGATC GAAACGTTAA CTGCCGGCGC GAACATTTCT
GTCCAATCCA CTCTTAATGA GATTACGATC CCTGCAACAG GAAACACTAA TATTCGTCTA
ACTGTATTTA GAATCGCTTA A
 
Protein sequence
MSRYDDSQNK FSKPCFPSSA GRIPNTPSIP VTKAQLRTFR AIIIDLTKII PKLFANPSPQ 
NIEDLIDTLN LLSKFICSLD AASSLKAQGL AIIKNLITIL KNPTFVASAV FIELQNLINY
LLSITKLFRI DPCTLQELLK LIAALQTALV NSASFIQGPT GPTGPAGATG ATGPQGAQGN
TGATGATGPQ GAQGNTGATG ATGPQGAQGN TGATGATGPQ GAQGNTGATG ATGPQGAQGN
TGATGPQGAQ GNTGATGPQG VQGNTGATGA TGPQGAQGNT GATGATGPQG AQGNTGATGP
QGAQGPAGAT GATGATGATG PQGVQGPAGT TGATGPQGAQ GPAGATGATG PQGATGATGP
QGAQGPAGAT GATGPQGAQG PAGATGATGP QGVQGPAGAT GATGPQGAQG PAGATGATGP
QGVQGPAGAT GATGPQGVQG PAGATGATGP QGVQGPAGAT GATGPQGVQG PAGATGATGP
QGVQGPTGAT GIGVTGPTGP SGGPTGPTGP QGPQGNTGAT GPQGIQGPAG ATGATGPQGA
QGPAGATGAT GPQGVQGPTG ATGIGVTGPT GPSGPSFPVA TIVVTNNIQQ TVLQFNNFIF
NTAINVNNII FNGTDTVTVI NAGIYVISVS ISTTAPGCAP LGVGISINGA VATDNFSSNL
IGDSLSFTTI ETLTAGANIS VQSTLNEITI PATGNTNIRL TVFRIA