Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcer98_2389 |
Symbol | |
ID | 5343518 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cytotoxicus NVH 391-98 |
Kingdom | Bacteria |
Replicon accession | NC_009674 |
Strand | + |
Start bp | 2475073 |
End bp | 2477601 |
Gene Length | 2529 bp |
Protein Length | 842 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640839896 |
Product | triple helix repeat-containing collagen |
Protein accession | YP_001375622 |
Protein GI | 152976105 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00549275 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTATT TTGATAGTCA TGATAAACTA CCTAAACCGT GTTTTCCTAA TAATTCTGGA CGCATCCCAA TCTTAAGCTC CATCCCGATT TCTAAATCCC AACTTCGAAC ATTCCGAGCA ATTATCTCCC ATTTAACAAA AACCATCCCT AACCTATTCC TTAATCCTTC TCCATCGACC ATTGAAGATT TTATAGATAC ATTATGCTTA TTAAAAAAGT TTATTAAATG TTTAGAGACT TCTTCTTCTC AAAAAGCAAT CGGACTTGCG ATTATAAAAA ATTTAATAAC CATATTAGAA AATCCTACAT TTGTTCCAGG CGCCGTATTT ATTGAACTTC AAAACTTGGT CAACTATTTA CTATACATCA CTAAATTATT CCGATTAGAT GATTGTATAC TTGAATGCAT CATTGATCAA ATTGAAGAAT TACAACTTAT ACTCATTGAA TTTGCACCAT TTGGAACAAT TGGGCCTACG GGCGCTACTG GACCTACCGG ACCTCAAGGG CCTCAAGGCG TTCAAGGACC TACAGGGGCT ACTGGACCTC AAGGGCCTCA AGGCGTTCAA GGACCTACCG GCGCTACTGG ACCTCAAGGG CCTCAAGGCG TTCAAGGGCC TACCGGCGCT ACTGGACCTC AAGGGCCTCA GGGCATCCAA GGACCTACCG GCGCTACTGG ACCTCAAGGA CCTCAAGGCA TCCAAGGACC TACCGGCGCT ACTGGACCTC AAGGACCTCA AGGTATCCAA GGGCCTACCG GCGCTACCGG ACCTCAAGGG CCTCAAGGCG TTCAAGGACC TACAGGGGCT ACCGGACCTC AAGGGCCTCA AGGCGTTCAA GGACCTACAG GGGCTACCGG ACCTCAAGGG CCTCAGGGCA TCCAAGGACC TACAGGTGCT ACTGGACCTC AAGGGCCTCA GGGCATCCAA GGACCTACCG GCGCTACTGG ACCTCAAGGA CCTCAAGGCG TTCAAGGACC TACCGGCGCT ACTGGACCTC AAGGGCCTCA AGGCGTTCAA GGGCCTACAG GGGCCACTGG ACCTCAAGGG CCTCAGGGCA TCCAAGGACC TACCGGCGCT ACTGGACCTA CCGGACCTCA AGGCATCCAA GGACCTCAAG GCATCCAAGG ACCTACCGGC GCTACTGGAC CTCAAGGACC TCAAGGTATC CAAGGGCCTA CCGGCGCTAC CGGACCTCAA GGGCCTCAAG GCGTTCAAGG ACCTACAGGG GCTACCGGAC CTCAAGGGCC TCAAGGCGTT CAAGGACCTA CAGGGGCTAC CGGACCTCAA GGGCCTCAAG GCGTTCAAGG ACCTACAGGG GCCACTGGAC CTCAAGGGCC TCAAGGCGTT CAAGGACCTA CCGGCGCTAC TGGACCTCAA GGACCTCAAG GCGTTCAAGG ACCTACCGGC GCTACTGGAC CTCAAGGGCC TCAAGGCATC CAAGGACCTA CAGGGGCCAC TGGACCTCAA GGGCCTCAAG GCATCCAAGG ACCTACAGGG GCTACCGGAC CTCAAGGGCC TCAAGGTATC CAAGGACCTA CCGGCGCTAC TGGACCTCAA GGGCCTCAGG GCATCCAAGG ACCTACAGGG GCCACTGGAC CTCAAGGGCC TCAAGGCATC CAAGGACCTA CAGGGGCCAC TGGACCTCAA GGGCCTCAAG GCATCCAAGG ACCTACAGGG GCCACTGGAC CTCAAGGGCC TCAAGGCATC CAAGGACCTA CAGGGGCCAC TGGACCTCAA GGGCCTCAAG GCATCCAAGG ACCTACAGGG GCCACTGGAC CTCAAGGGCC TCAAGGCATC CAAGGACCTA CAGGGGCCAC TGGACCTCAA GGGCCTACTG GCGCTACTGG ACCTCAAGGG CCTCAAGGTA TCCAAGGGCC TACAGGGGCT ACCGGACCTC AAGGTATCCA AGGACCTACA GGGGCTACTG GACCTCAAGG ACCTCAAGGC GTTCAAGGAC CTACCGGCGC TACCGGACCT CAAGGGCCTC AAGGCATCCA AGGACCTACA GGGGCTACTG GACCTCAAGG GCCTCAAGGC ATCCAAGGAC CTACAGGGGC TACCGGGCCT CAAGGTATCC AAGGACCTAC CGGCGCTACT GGACCTCAAG GGCCTACTGG GGCAACAGGA GCTAGCTTTC CAGTCGCAAC AGCCGTTCTA CAAAATTCTA ACTCTCAAAC TGTAGATTTA GGGGAAAATT TCGTCTTTAG CACAAGTTCT AACCTTAGAA ATATTAATTT CAATGGGACT GACACACTTA CTATTCTTGA AGATGGCGTT TATGTTATCA GTTTTTCAAT TTCTATCACT GCACCTGCTT GCGCACCATT CGGAGTAGGT ATTTCGCAAA ACGGAGCTGT ACCAAGTGAT AATTTCTCTG GTAATGTAAT TGGTTCTTCT CTTTCTTTCA CAACAATTGA AACATTGACA GCCGGCACAA ATATCACTGT TCAATCCACT CTAGGTGAAA TCTCAATTCC TGCTACAGGC GATACTAACA TCCGACTTAC TATATTTAGA ATCGCTTAA
|
Protein sequence | MSYFDSHDKL PKPCFPNNSG RIPILSSIPI SKSQLRTFRA IISHLTKTIP NLFLNPSPST IEDFIDTLCL LKKFIKCLET SSSQKAIGLA IIKNLITILE NPTFVPGAVF IELQNLVNYL LYITKLFRLD DCILECIIDQ IEELQLILIE FAPFGTIGPT GATGPTGPQG PQGVQGPTGA TGPQGPQGVQ GPTGATGPQG PQGVQGPTGA TGPQGPQGIQ GPTGATGPQG PQGIQGPTGA TGPQGPQGIQ GPTGATGPQG PQGVQGPTGA TGPQGPQGVQ GPTGATGPQG PQGIQGPTGA TGPQGPQGIQ GPTGATGPQG PQGVQGPTGA TGPQGPQGVQ GPTGATGPQG PQGIQGPTGA TGPTGPQGIQ GPQGIQGPTG ATGPQGPQGI QGPTGATGPQ GPQGVQGPTG ATGPQGPQGV QGPTGATGPQ GPQGVQGPTG ATGPQGPQGV QGPTGATGPQ GPQGVQGPTG ATGPQGPQGI QGPTGATGPQ GPQGIQGPTG ATGPQGPQGI QGPTGATGPQ GPQGIQGPTG ATGPQGPQGI QGPTGATGPQ GPQGIQGPTG ATGPQGPQGI QGPTGATGPQ GPQGIQGPTG ATGPQGPQGI QGPTGATGPQ GPTGATGPQG PQGIQGPTGA TGPQGIQGPT GATGPQGPQG VQGPTGATGP QGPQGIQGPT GATGPQGPQG IQGPTGATGP QGIQGPTGAT GPQGPTGATG ASFPVATAVL QNSNSQTVDL GENFVFSTSS NLRNINFNGT DTLTILEDGV YVISFSISIT APACAPFGVG ISQNGAVPSD NFSGNVIGSS LSFTTIETLT AGTNITVQST LGEISIPATG DTNIRLTIFR IA
|
| |