Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1860 |
Symbol | |
ID | 7310583 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2212905 |
End bp | 2214464 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 643608791 |
Product | Rhomboid family protein |
Protein accession | YP_002506188 |
Protein GI | 220929279 |
COG category | [R] General function prediction only |
COG ID | [COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000892671 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGTA ATTTTTTAAA TTCCTTGGTA AGATATCTTG TGGAAAAGGA CTATTACCAC CTTATATCTG CTGATAATCA GATACCTGAT TTTTCTAATG GTATTGCAAG TTTAATTAAA GAGATACAGG GAACTTCTGT ATTTGTTGAA ATTATTGACG CAGACAGATA CGGAACAGAA CAAATCAGGA ATATTATGTT AAACGGTGCA GCAATGCTGA ATAATATTCA AGGCAATAAC GCTTATATTT TCAAGGTTTT TTTGTTTGAT AGTACTCCTG ATATGGATAA AGTTGAAATT ATAAAGCAGC ATCAGATGGA TATTACTTCG GAAAAACGCT TTATGAAATG TATTTCTGTA AACATTTCTG CAAAACAGGC TGAGAAATAT TTCAGTGTTC CTGCTTTCGA TGCCGGATTG GTAAAGTCTT TTAAAAGATT TTTTTCTAAG GGACTTGATA AAAGAGAAAC CAGTTATAAG GATATTGAAG ACGTTATTGA GAAAAGAAAA AAAGACTTTG AAATACAGTC CAAGGCTGAA ACGCCATGGC TGACATACAT TATTATTGCT TTTAATATTG TTATGTGGGG CTTGTTGCAG CTTGTGTCCA TGAGAACCGG AACCGCTTAT CAACAACAGC TGGAACCCTT TGGAGCAAAG GTAAATAATC TCATTATGGA AGGGCAGTAC TGGAGATTTA TATCACCTAT GTTCCTGCAT GGAGATATTG TCCACCTGGC TGTAAACTGC TATTCGCTGT ATATCATAGG GTCTCAGGTG GAGAAAATAT TCGGACGAGG AAGGTTCTTG GCTATTTACT TTGTGTCGGG TTTCATTGGC TCAGCAGCAA GTTTTGCATT CTCACTGAAT TCCTCTGTAG GGGCATCAGG TGCTATATTT GGTTTAGTAG GTGCTATGCT CTATTTTTCG TTAAGACGTC CTGCACTTTT AAAAAGCAGC TATGGTGTAA ATCTTATTAC TATGCTTATA ATAAACCTTG CTTATGGTTT TATGAACAAG AGGATAGACA ACCATGCACA CATAGGTGGT TTTGTAGGAG GATTCTTGAC TGCTGGGGCT GTATATTCCT ACAGGGAAAT AAATGGAAAA AACATATTGA AAAAAGTAAC ATCTATTTTA CTTGTGGCAG CGATTACAAT GGGAATGTTA TTTTATGGCT TTAATAATGA TATAAATGTT CTTTCTCCTA AGCTTGCTGC ACTGGAGCAG TCCGATATTC AGAATAACTG GCAGGAGTCT GAAAAAAAAG CAGAGGAAAT TCTTGACTTG AACCCTTCCG ACAAAAATAC AAAGATTAGG GTATTATGGT CATTAATCAG GGCTGAAATT GGTCAAGGAA AGCTGGATGA AGGTATTCAA AATTCAATGG CCTTGGCAGA ATTGAGTCCG GCTGACGGAC ATTACCTGCT CGGAGTCATA TACTATAATA CGAAAGAATT TGGTAAAGCT AAGCAGGAGC TTGAGCAAGC AAAAAAATCA GGGTCTCCCA ATATTGATAA TATTAATGAA ATGCTTTCCG GTATTGAAAA CAGTAAATAA
|
Protein sequence | MKSNFLNSLV RYLVEKDYYH LISADNQIPD FSNGIASLIK EIQGTSVFVE IIDADRYGTE QIRNIMLNGA AMLNNIQGNN AYIFKVFLFD STPDMDKVEI IKQHQMDITS EKRFMKCISV NISAKQAEKY FSVPAFDAGL VKSFKRFFSK GLDKRETSYK DIEDVIEKRK KDFEIQSKAE TPWLTYIIIA FNIVMWGLLQ LVSMRTGTAY QQQLEPFGAK VNNLIMEGQY WRFISPMFLH GDIVHLAVNC YSLYIIGSQV EKIFGRGRFL AIYFVSGFIG SAASFAFSLN SSVGASGAIF GLVGAMLYFS LRRPALLKSS YGVNLITMLI INLAYGFMNK RIDNHAHIGG FVGGFLTAGA VYSYREINGK NILKKVTSIL LVAAITMGML FYGFNNDINV LSPKLAALEQ SDIQNNWQES EKKAEEILDL NPSDKNTKIR VLWSLIRAEI GQGKLDEGIQ NSMALAELSP ADGHYLLGVI YYNTKEFGKA KQELEQAKKS GSPNIDNINE MLSGIENSK
|
| |