Gene Ccel_1960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1960 
Symbol 
ID7310675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2319623 
End bp2321836 
Gene Length2214 bp 
Protein Length737 aa 
Translation table11 
GC content39% 
IMG OID643608894 
ProductYhgE/Pip C-terminal domain protein 
Protein accessionYP_002506288 
Protein GI220929379 
COG category[S] Function unknown 
COG ID[COG1511] Predicted membrane protein 
TIGRFAM ID[TIGR03057] X-X-X-Leu-X-X-Gly heptad repeats
[TIGR03061] YhgE/Pip N-terminal domain
[TIGR03062] YhgE/Pip C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.779213 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACAT TGAAAAAGTA TAAGAAATTT GCCGTAATCG TGGCAGTTAT ACTAATTCCA 
TTGGTCTACA GTTTCTTCTA TCTTGATGCT TTTTGGGATC CATACAGCAA ACTTGACAAA
CTTCCTGTAG CCGTGGTAAA CCAGGATAAC GGAGCAACTA TCGACGGCGA AAACAGAAAT
CTGGGAAAAG AAATTACAGA CAATCTAAAA ACCGATAAAA ATCTTAAATG GGTTATTACG
TCGGAATCTG ATGCAAAGGA CGGTTTAGAG AACAGAAGGT ATTACGCAAT GATTAGTATA
CCTGGAGATT TCTCAAAAAA TATTTCTTCA GCAGCAGACA TTGACAAGAC TCAGGGTAAT
CTGATATATA CCGTTAATGA AAAGGGAAAT TACCTGGCAA GTCAGGTGCT GAGCAGGGTC
ACATTAGAGT TTAAGGATAA AATTTCCAAA TCTGTTTCTG AAGAAATTGT AGGAACCCTT
TTAGATCAGA TAAAGGACCT TCCTACTAGT CTAAAAGAAC TTGATGATGG CTTAAAGGAA
ATAAAAGACG GAGCAGAACT ACTCTATGAC AGCAATGGTA AAATTGTAGA CGGACAGAAG
AATTTTAATG ACGGTGTTAA TAAGCTGAAC AACGGACTTG CAGATGCCAA TAATGGTTCA
AAGACTCTTA TACAAGGTTC AAAACAGCTT AGCGACGGTG CAGAACTATT CTACAGAAAT
TTATCAGGCG GCTCAGGTAA AATGACAGCT CTGGTTAACG GTTCAAATAC TTTTATGTCA
GGTCTGTCAA ATCTGAATTC GGGTTTAAAC CAGTTGAATT CCAGCATTAC AGAAGCAGCT
CCGCAAATAT CTCAGTTGTC TAAAGGAACC TTGGACTTAA ACAGCGGAGT ACAGTCCTAT
ACATCAGGTG TTGACAAGTA TATTGAATCG GTAAACAAGG TTTCTCAGAC TCAATCTGCT
TTGGCAAACT CTATTCAAAA ATACGTGGCA AGTCATCCTG AAGCATTGAC AGACCCAAAT
TTCAAGGCTG TAATCGCTAC TCTGGAGGCC TCAAAGTCTG TTCCTGAACA GCTTAAAACC
GCCGGAGGGC AGCTTTCTTC CGCCGGAAAA CTACTTACTG ACGGTTCAGG CAAGGTTGCA
GGTGGAGTCT CACAGTTGAC TACCCAGTTG AGTTCAGCAG CACAAGGTAT AAATAAACTT
GCTGCAGGAT CAAATGAATT GAACAAATCA TACCCCATGA TTAACCAAGG TATCCTTGAT
ACTGCCTCCA GCATCAAAAC TGCATCCGAT AAATCCAAGG AACTTTCTTC AGGGGCTTCA
TCGGTTAATG ACGGAGTTGC AAAACTTTCA AGCGGTATTT CAGAACTGGC AGCCGGCAGC
GAGGAATTAT CCAAGAATTC TGGAGTGTTA CTTGACGGTG AAACAAAGAT TCAGGACAGT
TTAGGCAAAC TTAAGGATGG GGTAACAGAA GCAAGCAGCG GTGTGTCCTC TTCCCTTCTA
AAGGCTGATG GTAAATTAAA CGGTACAGAA GGCCTAAAGG AATATGCAGC AGATCCTGTT
AAAATAACAG AAAAGAAGGT TTACGGTATT CCTGACTATG GTACTGCGTT TACACCTTAT
TTTGTATCAC TATCACTTTG GGTTGGTGCA TTGCTGATGT TCTTTGCAAT TTATCTGGAT
GAGGAAGTAA GGTTTCGCAA ATTCTCCTCC AAGTCTAAAG GTATTATGAG ATTTTTTGCA
TATACTTTAA TAGGTATTGC CCAGGCTCTT GTATTAGACT TTGTCATAGT AAAAGGTTTA
CATTTGGAAG TTGCAAATAT GGGGCTTTTT GTGCTGACTA GTATAATAAT ATCATTGTCA
TTTACATCAA TTATGAGATT TTTACTAGTA CAGTTAAGAG ATGTAGGCAA GTTCCTTGCA
ATTCTGCTAT TGATATTACA GCTTACTTCA TGTGGAGGAA CCTTCCCTAT GGAACTTGTT
CCACGATTCT TCAATGTACT TAATCCGTTT ATGCCAATGA CATATTCGGT TAATGCGTTA
AGAGAAGTAA TTGCAGGTAT CAATAATGGA TTCCTTGCAC AGAACCTTAT TGTTTTAGTT
ACCGTAATGA TAGGATTTCT TATATTGAAC CTTGTAGTAT CAAAGCTTAG ATTTGGAAGC
ATTTCTTCTG ATTCTGATGA TTTTGTTAAA ATATCCGAAG AAGTTTCAGC ATAA
 
Protein sequence
MQTLKKYKKF AVIVAVILIP LVYSFFYLDA FWDPYSKLDK LPVAVVNQDN GATIDGENRN 
LGKEITDNLK TDKNLKWVIT SESDAKDGLE NRRYYAMISI PGDFSKNISS AADIDKTQGN
LIYTVNEKGN YLASQVLSRV TLEFKDKISK SVSEEIVGTL LDQIKDLPTS LKELDDGLKE
IKDGAELLYD SNGKIVDGQK NFNDGVNKLN NGLADANNGS KTLIQGSKQL SDGAELFYRN
LSGGSGKMTA LVNGSNTFMS GLSNLNSGLN QLNSSITEAA PQISQLSKGT LDLNSGVQSY
TSGVDKYIES VNKVSQTQSA LANSIQKYVA SHPEALTDPN FKAVIATLEA SKSVPEQLKT
AGGQLSSAGK LLTDGSGKVA GGVSQLTTQL SSAAQGINKL AAGSNELNKS YPMINQGILD
TASSIKTASD KSKELSSGAS SVNDGVAKLS SGISELAAGS EELSKNSGVL LDGETKIQDS
LGKLKDGVTE ASSGVSSSLL KADGKLNGTE GLKEYAADPV KITEKKVYGI PDYGTAFTPY
FVSLSLWVGA LLMFFAIYLD EEVRFRKFSS KSKGIMRFFA YTLIGIAQAL VLDFVIVKGL
HLEVANMGLF VLTSIIISLS FTSIMRFLLV QLRDVGKFLA ILLLILQLTS CGGTFPMELV
PRFFNVLNPF MPMTYSVNAL REVIAGINNG FLAQNLIVLV TVMIGFLILN LVVSKLRFGS
ISSDSDDFVK ISEEVSA