Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_3003 |
Symbol | |
ID | 7312435 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | + |
Start bp | 3555006 |
End bp | 3556196 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643609907 |
Product | glycosidase PH1107-related |
Protein accession | YP_002507277 |
Protein GI | 220930368 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2152] Predicted glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.87538 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTTTTG ATGAGAGATT GAATTATTTT ATCCAAAAAT ATAATGGATT GATTAAAAGA AAAAACTCCG TTCAAGAAGG CGGAAACGGC ATATACGATA AATACATATA TCCTGTAATA ACCAAGGACC ACACACCTCT TTTCTGGAGA TATGACCTTG ACAGGAATAC TAATCCCTAT CTTATGGAAC GCCTCGGCGT CAATTGTGCT TTCAACCCGG GAGCTATTGA GCTTAACGGG AAAATATACC TTGTAGTACG TATAGAAGGG AACGATAGAA AATCCTTCTT TGCAGTGGCT GAAAGTGAAA GTGGCATTGA TAACTTCACT TTCTGGGATT ACCCTGTGGT AATGCCGGAA ACAAAAAACC CTGATATCAA TATATATGAC ATGAGGCTTG TAAAACATGA GGACGGTTGG ATTTACGGTC TTTTCTGTAC AGAGAGAAAA GACCCGGAGG CTGCCTGCGG CGATACCTTC AGTGCAGTAG CGGCATGCGG CATTGCTAGA ACTAAAGATT TAAAATTATG GGAGCGTTTG CCTGATTTGA AAACAGCATC ACCTCAACAG AGAAATGTGG TCCTTCACCC AGAGTTCATA AAGGGAAAAT ATGCCCTTTA CACACGTCCT CAGGACGGCT TTATCGAAAT AGGAAAGGGC GGCGGAATAG GATTCGGTTA TACCGACAGT ATGGAAAATG CTGTTATATA TGAAGAAATA CTGATGGAAT GCAAGGAATA CCACACTATT AAAGAGGTCA AAAACGGCCA AGGGCCCACC CCTTTGAAGA CTGAAAAAGG CTGGCTTCAC ATTGCCCATG GCGTAAGAAA TACGGCTGCC GGACTAAGAT ATGTTGTATA TGCCTTTCTT TCCGACCTTG AACATCCCGA AATTGTAACC CATCGTCCGG GTGGCTTTTT AATCGCTCCT GAAGGTGAAG AACGAATCGG GGACGTTTCA AATGTGGTCT TCTGTAACGG TGTTGTAGCC AGACAAAACG GAGATGTTCT TATTTACTAT GCTTCCTCCG ATACACGTTG CCATGTAGCT TCTACCACTA TAGACAAGCT TCTGGACTAT GTCATTAATA CCCCGGAGGA CCCTCTAAGA TCCTATGCAT GTGTGCAGCA GAGAATTGAC CTGATTTCAA GAAATTTGAG GCTGATTAAG AATAACACTG TTTCTGTGTA A
|
Protein sequence | MVFDERLNYF IQKYNGLIKR KNSVQEGGNG IYDKYIYPVI TKDHTPLFWR YDLDRNTNPY LMERLGVNCA FNPGAIELNG KIYLVVRIEG NDRKSFFAVA ESESGIDNFT FWDYPVVMPE TKNPDINIYD MRLVKHEDGW IYGLFCTERK DPEAACGDTF SAVAACGIAR TKDLKLWERL PDLKTASPQQ RNVVLHPEFI KGKYALYTRP QDGFIEIGKG GGIGFGYTDS MENAVIYEEI LMECKEYHTI KEVKNGQGPT PLKTEKGWLH IAHGVRNTAA GLRYVVYAFL SDLEHPEIVT HRPGGFLIAP EGEERIGDVS NVVFCNGVVA RQNGDVLIYY ASSDTRCHVA STTIDKLLDY VINTPEDPLR SYACVQQRID LISRNLRLIK NNTVSV
|
| |