Gene Ccel_3035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3035 
Symbol 
ID7311640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp3585089 
End bp3587614 
Gene Length2526 bp 
Protein Length841 aa 
Translation table11 
GC content42% 
IMG OID643609937 
ProductPhage-related protein-like protein 
Protein accessionYP_002507307 
Protein GI220930398 
COG category[S] Function unknown 
COG ID[COG5412] Phage-related protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.716589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAATG GAAGTGTAGG CCAAATAGCT TTAGACCTTG GAGTAAACTA TGACGGCTTT 
AACAAGCAAT TAAGCGGCAT TGCCGGTAAC GCTACAAACA TGGTGGGTGC ATCGTTTAAA
AAGCTTGGTG GAATAGTCGC TGCAGCCTTT GCTGTAGACA AACTTATTGA TTTTGGTAGA
CAGTCCATGG AGCTTGCATC CAACCTTACA GAGGTGCAAA ACGTTGTAGA TGTTACGTTC
GGATCCATGG CAGCAGATAT AAATGCCTGG TCTAAAAATA TGTTGTCTGG ATTTGGCCTA
TCGGAATTGT CTGCAAAGAA ATATTCTTCA ACTCTTGGTG CCATGATGAA AAGTTCCGGG
TTGGTCGGCA CACAAATGGA GGGGATGTCA AAGAAGCTGA CGGAATTATC AGCTGATATG
GCATCCTTTT ATAATTTGTC CAATGATGAG GCGTTCGAGA AAATCCGCTC TGGAATAAGT
GGTGAAACAG AGCCATTAAA GCAATTAGGC GTTAATATGT CGGTTGCAAA CATGGAAGCT
TATGCCTTAA CTCAAGGGAT AAAGAAACAG TATTCGGAAA TGAGCCAAGC CCAACAGACT
TTACTCCGGT ATAACTATCT GCTTTCGGTT ACAAAGGATG CCCAGGGTGA CTTTGCAAGG
ACATCTGGTT CATGGGCAAA TCAGATAAAA CTGCTTGGAG AGCAGTGGAA TATATTTAAA
GGCAGCATGG GTGCAGGTTT TATTAATATC CTTGCACCTA TAGTCCGGGG ACTTAACTTC
CTTATTTCAA AATTGCAGAT TGCAGCTCAG TACTTTAAAG CTTTTACCGC TCTATTATTT
GGGGATGCTT CAGGAGGTTC AGTAAGCGGT GCGGCAAACA ATGCGGCATC AGCTACATCT
GATATGGGAA CAGCCGCCGG AGGAGCAGGG AAAGAGGTCA AAAAAGCTGG GAAAGATGTT
AAAGGAGCAT TGGGTGGCTT TGACCAGTTA AACACATTAG CCATGTCTGC GGCTGATTCA
ATGGACGATG CTGGCTCTGC CGCCGGAGGT CTTGCAGATA TGGGCGGCAT GGGGGATATG
AATCTTGGTG GGGGCAATAT AGACATTGGG ATTGACCCTA GTAAGCTCAA ACCCCTACAA
GACATACTTA ATAATATTAA GTCAATTGCA AGTCAGGTGG CTGGATATTT TGTTGCGAAC
TTTGGCCCTC CAATAGCTCA AGCTATTTCT GCTATATCGG TACCGTTACA AGGCTGGAAA
ACTGCTATTG CTGATGCTTT TTCCCAATTT CAAACACTTG GAGAGCCTTT GAAACAATGG
TTTGTAGGAG ATTTTACAAC ACAGATACAG ACGGGTATTA AAGTAACAGG CAATGTCTTA
GCGGGGTTGC TTGATACTGG GCTAAAAGTG TTCAACACTA TAAAAGAAGT AGCTTTTCCG
ATTGTATCAT GGTTTGTAAC CGACGGATTA CCAATGATAT CTCAGTTTAC AACACAGTGG
GGTAATCTGT TTCAGAATAT GTTCGACAGT ACAAAACAGA TCTTCGATAT GTTATGGGAT
CAGGGAGTCG CTCCGGGCCT GAGAGTTGTA TCAGGGATGA TACTCGATAC GTTGAACATT
ATAAAAGGCT TCTGGGATGA CTGGGGAGGC AAAATATTTG AAGGGCTTAA CGGACTGGTT
GGTTCTATAA AAAGTGTAAT GACAAATCTG TGGCAGAACT TTTTACAACC AATTTGGCAA
AACATTTGCG AAACTATCAG CTGGCTATGG AGCAAGCATT TAAAGGGACT GATAAAAGAG
ATTACAGACT TCGTTGGAAA ATTGGTTACC GCCGTACTTG ATATCACCAA CAAGTTTATT
ACCCCACTGG TAAATATGCT GATAAAAGTT CTCGGCCCTA CTTGGTCTAA TATTTTCAAC
GGTATTGTTA ATGTAATCGG TACGGTGGTC GGAACTATTG TTGATGTCAT TAAGGGGTTG
ATAAAATCTC TTGGTGGTCT TGTAGACTTT ATTGCAGGAG TATTCACTGG TGACTGGAAG
AGAGCCTGGA CAGGTATAAA AGATTTCTTT AAAGGAATAT TCGACAGTAT TGTTGGCATT
TTCAAGGGGG CCATAAACCT GATTGTTGAT GCATTTAACT TCATGATTGG AGCTTTAAAT
AATATTCAGA TAAAAATCCC TGATTGGTCA CCTATAGGTG GGGGCAAATC CTTTGGCATA
AATATACCCA AAATCCCTAA GCTTGCAAAC GGGGGGTTGG TATCAGCTCC AACACTGGCC
ATGGTCGGAG ATAACCGTAA CGCTCAAGCA GACCCAGAGG TAGTTAGTCC TTTGTCAAAG
CTGCAGGGTA TGTTAAACGG AGGTAATCAG GAAGTTGTTG CTGCTATAAA TGAGCTAATG
GAACTTATCC GAAACTTGCA AACACCTGTA ATTATGAAAG TAGGGGAAAC TGAATTCGGC
AAGGCCGTGA TAAGAACTGC AAATGTAGCC AATAGGCAAT CAGGCTATAC CCTATTCGAG
GTATAA
 
Protein sequence
MSNGSVGQIA LDLGVNYDGF NKQLSGIAGN ATNMVGASFK KLGGIVAAAF AVDKLIDFGR 
QSMELASNLT EVQNVVDVTF GSMAADINAW SKNMLSGFGL SELSAKKYSS TLGAMMKSSG
LVGTQMEGMS KKLTELSADM ASFYNLSNDE AFEKIRSGIS GETEPLKQLG VNMSVANMEA
YALTQGIKKQ YSEMSQAQQT LLRYNYLLSV TKDAQGDFAR TSGSWANQIK LLGEQWNIFK
GSMGAGFINI LAPIVRGLNF LISKLQIAAQ YFKAFTALLF GDASGGSVSG AANNAASATS
DMGTAAGGAG KEVKKAGKDV KGALGGFDQL NTLAMSAADS MDDAGSAAGG LADMGGMGDM
NLGGGNIDIG IDPSKLKPLQ DILNNIKSIA SQVAGYFVAN FGPPIAQAIS AISVPLQGWK
TAIADAFSQF QTLGEPLKQW FVGDFTTQIQ TGIKVTGNVL AGLLDTGLKV FNTIKEVAFP
IVSWFVTDGL PMISQFTTQW GNLFQNMFDS TKQIFDMLWD QGVAPGLRVV SGMILDTLNI
IKGFWDDWGG KIFEGLNGLV GSIKSVMTNL WQNFLQPIWQ NICETISWLW SKHLKGLIKE
ITDFVGKLVT AVLDITNKFI TPLVNMLIKV LGPTWSNIFN GIVNVIGTVV GTIVDVIKGL
IKSLGGLVDF IAGVFTGDWK RAWTGIKDFF KGIFDSIVGI FKGAINLIVD AFNFMIGALN
NIQIKIPDWS PIGGGKSFGI NIPKIPKLAN GGLVSAPTLA MVGDNRNAQA DPEVVSPLSK
LQGMLNGGNQ EVVAAINELM ELIRNLQTPV IMKVGETEFG KAVIRTANVA NRQSGYTLFE
V