Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1978 |
Symbol | |
ID | 7310691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2341223 |
End bp | 2342230 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 643608913 |
Product | germination protease |
Protein accession | YP_002506306 |
Protein GI | 220929397 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01441] GPR endopeptidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000089254 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTAATT TGTCAAACAG GTATACAGAC CTTGCCATTG AGGCACATGA AATATTTATG AACAGCAGTG AAGCAAAACA GCAGGCTAAG GAGGGAAAAA CACCTCCGGG TGTGGTAATT GAAAATGCCG GAGACGAGGA TGTAAAGATT ACAAGAGTCC ATATTACATC CCGTACAGGT GAGGCTTCCA TCGGAAAGCC CATGGGAAAC TATATAACTC TTGAGGTTCC GGAGCTGAAG GATAATAATG AGGATGTAAA CAAAAAGACT ATTGATGCAA TGGCAAAGGA ATTGAGGGAT ATCCTGAAAC TGAAAGAAGG CTCTACAACA CTTGTCATAG GATTAGGAAA CTGGAATGTA ACTCCTGATG CGTTAGGGCC GAAGGTAGTG TCAAACCTCA TGGTAACAAG GCATCTGCTT CAATATGTCC CCGAGCATGT TGATGAGGGG GTGAGTCCTG TTTGTGCTAT TTCTCCGGGA GTACTAGGAA TAACTGGAAT AGAGACCGGG GAAATTGTAA AAGGAGTTGT TGACAGGCTT AAACCTGATG CATTGATTGC AATAGATGCT CTTGCTGCAA GGAGCATGGA AAGGGTTAGT ACTACTATTC AGATTGCAGA TACAGGGATT GCACCGGGAG GGGGAGTCGG AAACAAGAGA ATGGAGCTAA GCGAAAAGAC TCTGGGGATG CCCGTAGTTG CTATAGGCGT GCCGACTGTG GTTGATGCGG CTACTTTGAC AAATGATGCC ATGGATCTTG TAATTGATAG CCTGATGAAT GAATCACCAA AGGATTCCGG TTTTTATAAT ACCCTGAAGG ATATTGACAG AGATCAGAAG TATGAACTTA TTAAGGAAGC CTTAAATCCA TTTGTAGGTC ATCTTATTGT TACACCTAAG GAAATAGATG ATATAATCAG CCGTGTTTCA AAAGTGGTAG CCAACGGGCT CAATATGGCT CTACATCAAG GAATCACACT TGGTGACGTC GGCAGATATA TCAACTGA
|
Protein sequence | MINLSNRYTD LAIEAHEIFM NSSEAKQQAK EGKTPPGVVI ENAGDEDVKI TRVHITSRTG EASIGKPMGN YITLEVPELK DNNEDVNKKT IDAMAKELRD ILKLKEGSTT LVIGLGNWNV TPDALGPKVV SNLMVTRHLL QYVPEHVDEG VSPVCAISPG VLGITGIETG EIVKGVVDRL KPDALIAIDA LAARSMERVS TTIQIADTGI APGGGVGNKR MELSEKTLGM PVVAIGVPTV VDAATLTNDA MDLVIDSLMN ESPKDSGFYN TLKDIDRDQK YELIKEALNP FVGHLIVTPK EIDDIISRVS KVVANGLNMA LHQGITLGDV GRYIN
|
| |