Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_2823 |
Symbol | |
ID | 7311445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 3379270 |
End bp | 3381789 |
Gene Length | 2520 bp |
Protein Length | 839 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 643609718 |
Product | phage minor structural protein |
Protein accession | YP_002507097 |
Protein GI | 220930188 |
COG category | [S] Function unknown |
COG ID | [COG4926] Phage-related protein |
TIGRFAM ID | [TIGR01665] phage minor structural protein, N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.4802 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAATTA AATCAATATT GACATCACAA TCTGACTTTA CGGGTGAGTT TCCTGTAAGT GAGAATACCC TTGCCCTTTG GCGATTCAAT GAGTCTGGAC CTGACTCCGA TGTTAAACTT GTCGATGCGA GTGGTCACGG TCGGCACGTC GCTATATCGG GATGGTCTGG CACATCGGCA AGTCTCCCAA ATGGTCGCTA TGGACGATTT TTTCGTCAGA ATATTATTAA TCCGACATCA GAGAAAACTT ACTTGATTGC AAAGAATGAT GGTACATTTT TCTCCAATCT AGGCGATAAA ATCGCAGTAG GTGGATGGAT AAACCCAACT ACCTACTCTG TGGGACAGAC CTTTATTCCG CTGTTTAATA CAAGACAAGG CCCGGGTCAA CCTATTCTAT ACTTGTCCCT GTACCAGGGC AGACCTCGAA TGATGTTATA TAACTCTTCT GGTTCACTGA TTCTCGACCA GAGTGAAACA CCAAGTTTTA ACATGGTTAA TGGTGGGTGG TACTTTATTG CAGCAATTAT TGAAGTGAAT ACCAAGACAT CGCAGTTTAT TCTTTGTAAC CGAACAGACG GTACAGTCTG GATTGCGCCT AAACGGACAT TCACGGGAAC ATTAAACCCA TCTTGCACAG CCGATATTGT GATTGGAATG CATGCAAATC AGTATTATTA TGCAGGTGGA TTTGATGATT GGTTCATCGA AGTAAATTCA CAGCTTACAA TTGAGGATTT AGAACGACAT TTCAAGCAGT CAATACTTGC CAACGGCGGT GATACCAGTG GTGCGGTTGA TGCTATAACA GAACCTGGTG TGGTAACACT CCTCAAGGAT ACTAACAACA GATATCCTGA AGGTGGGCAG CTTACGACTA TAGCTGCGGA ATGTTCTCTT GCAGGTAGTG GTCGAGTGTC TGTAACTTCG GAATATACTG CAGGTGTGAC TTCAATTTCT TCTATAGAAA CTGCTACATC AGATGATTTG CAGGATTGGT CGGCGTGGCA GGTGGTAGGT TCCAACGGTG AACTAGTATC TCCAAACCGT AATTTTATTC GCTATAGAAT TACACTTTCA ACTACTGATC CCTTGGTTAC CCCGAAACTA TTGGATATCA CACTTCACGA CATACCGAAA GCTCCATATG AGAAACTGGG TTTTGCCAGG CCTGTCGTGC TGGACGAAAA TGGTGCATGG GAAGCGGTAC TTGAAAATGC ATATGACATC ATCGTAACGG GAGAGATTAA CGGCGCAGAT ACTTTGGAAT TCAAACTCCC ATGGAATGAC AGCAAGCGTG TACATTTAGA TAATGAAAAA CAAGTTCAGG TGGCACATGA CATTTATCGC ATAAGGACGC TGACAGATGA AAAAGGAGCA GATGGAACGG GTGTTCTGAC CACCGTATAT GCGGAAGCAG CATTTTATGA TTTGACCTTT TCAGCTGAAA AACAGCCGAG AGAATTTAAT GCCGATTTAC CATCTGCTCC GATGAGTTAT GCGTTGGAAG GTACAGGATG GTCTCTTGGT GTAGTTGATG TCACTACCAA GCGGACATGG CAATGTCAAG AGAAGAATGC ATTGGCCATA CTCCGAATGA CGCAACAGAT TCACGGTGGG GACTTGGTAT TTGATAGTCG AAACCGTCTT GTTAGTTTAC TGAGATTTAG TGGTAGTGAT AGCGGTGCAT TATTTGCCTA TAAGAAAAAC CTTACAAGTA TCAAGAGGGT TGTTGATACC CGCTCTCTGG TGACAAGGCT GTATGCCTAT GGTAAAGATG GCATGACATT TGCCACTATC AATGGTGGCA AGGAGTATGT GGATAACTAT GAATATTCCA ATGAGGTACG GGTTTCAACA CTCGATCTTT CGAATTTCAC AAATCCTTAT CAAATGCTTG AGTTTACGAA TATGCGACTT GCAGAGTACT CAAAGCCTCG TGTTTCCTAT GTGCTATCAG CGATGGATTT GTCTGTGCTG ACAGGTTATG AGCATGAAAG GTGGTCACTT GGCGACATTG TGACGGTGGA TGATAGAGAT TTAAATCTTA CTATCAAAAC AAGGGTTGTT CGCAGACAAT ATAATCTTCA GGAGCCGTGG AAAACTGTAC TTGAATTATC ATCAAAACTT CGTGAACTTG GGGATACGTC CTCTGGCATC CTCGCTGATC AGCTTGACCA AAGCAACCTC ATTGGGCAGG AAATCAAAGA TATGGTGCCG TTCAACCACC TGCGTAATAG TAGAGCAGAC GATGGCTTCG CTTACTGGCA GAATTCAGGT TTTGAGGTGG ACACTGAAAA AGGTGTGACC GGTACAGCTT CCTTTAAAGC GGTAGGCTCT GCAACTGCTA CAAAAAGTAT GGCTCAAACA GTATATCCTG CATCCCGCCG TAACTACACC ATATCAGCAC AAATAGGCTC AGATAACCTC CAAAAAGGCC CAGATGGACA AGTGGGTATC GAAGTAGTGT TTGAGTTTGA GGACGGAACT ACTGAAACGA GATTTATTGA CCTATATTAA
|
Protein sequence | MAIKSILTSQ SDFTGEFPVS ENTLALWRFN ESGPDSDVKL VDASGHGRHV AISGWSGTSA SLPNGRYGRF FRQNIINPTS EKTYLIAKND GTFFSNLGDK IAVGGWINPT TYSVGQTFIP LFNTRQGPGQ PILYLSLYQG RPRMMLYNSS GSLILDQSET PSFNMVNGGW YFIAAIIEVN TKTSQFILCN RTDGTVWIAP KRTFTGTLNP SCTADIVIGM HANQYYYAGG FDDWFIEVNS QLTIEDLERH FKQSILANGG DTSGAVDAIT EPGVVTLLKD TNNRYPEGGQ LTTIAAECSL AGSGRVSVTS EYTAGVTSIS SIETATSDDL QDWSAWQVVG SNGELVSPNR NFIRYRITLS TTDPLVTPKL LDITLHDIPK APYEKLGFAR PVVLDENGAW EAVLENAYDI IVTGEINGAD TLEFKLPWND SKRVHLDNEK QVQVAHDIYR IRTLTDEKGA DGTGVLTTVY AEAAFYDLTF SAEKQPREFN ADLPSAPMSY ALEGTGWSLG VVDVTTKRTW QCQEKNALAI LRMTQQIHGG DLVFDSRNRL VSLLRFSGSD SGALFAYKKN LTSIKRVVDT RSLVTRLYAY GKDGMTFATI NGGKEYVDNY EYSNEVRVST LDLSNFTNPY QMLEFTNMRL AEYSKPRVSY VLSAMDLSVL TGYEHERWSL GDIVTVDDRD LNLTIKTRVV RRQYNLQEPW KTVLELSSKL RELGDTSSGI LADQLDQSNL IGQEIKDMVP FNHLRNSRAD DGFAYWQNSG FEVDTEKGVT GTASFKAVGS ATATKSMAQT VYPASRRNYT ISAQIGSDNL QKGPDGQVGI EVVFEFEDGT TETRFIDLY
|
| |