Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1890 |
Symbol | |
ID | 7312296 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2245123 |
End bp | 2246688 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643608824 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_002506218 |
Protein GI | 220929309 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.206887 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCGACG ATAGAGACGA TAAGATAAAT GAAAATGGTT CCAGTATTGA CCAGCCGCAG AATGATAACG GCACTTATGA AGCAAACAGC AATTCAGAAT CACAGGAATC TGTAAACCAA ACTCAGGAAG ACATCCAGAA CGGTAGTAGT TGGCAATACG AAACAGTAAC TCCTGTAAGC GAAAACAAGG AGCCTGTTGA TAATGTAGTG GGTGAAAATA CTAGTCAAGA TACTGTTACC GATAATTACA ATTTAGATCC ACAAGAATCA CAAACCGAAA CACAAGATGC CAATAAGTCA GGGTTTGGCG AAGATTGCAA TTTGGAAACA CAAGTAGAAC CTAACTCTGT TTACAATAAT TATTACAGGG AAAACTGTAA AAAAACAAAT ACTAAAAAGT CAAATGCATG GAAATACGTA TTGGTATCAG CTGTCAGCTC ACTTGTTGGA GCGGCACTTC TTGCATTGTT AATGCTTTTT GTTGCACCGT TTGTACAGCC CCAAATAAAA TCATACCTTG GAAATAATTT CCCTGGCCTT AAAACTGAGA GTACCCAGCC CAATACCGGC GAACTAAAGA GAGTTGAAAT AGTTCAAAGT GGAGAATCAG CCGTTACAAG TGTAGCAGAG AAAGTCGGGC CTTCAGTTGT AGGTATTAAA ACATCTTATC AGAATACCAA TGAATTGTTT GGAGTACAGT CCGGGGGTGG TGAAGGATCA GGTATAATAA TAAGTGCTGA TGGATATATT CTTACAAACC ATCACGTTAT AGAAGGTGCC TTAAATGATA AAACAAGAAA TATAAGAAGT GATGCAAAAA TTGAAGTGTT CCTTCCCAAC AAAATAGACA AGCCTTACTC TGCTATAGTT AAAGGGTACG ATGCAAAGAC AGATCTGGCT GTATTAAAAA TCAATGATAC AAACCTCCCT GTTATAGAGT TCGGTAATTC GAATGATATA AAGATCGGTG AGCCGGCCAT TGCAGTAGGT AATCCAGGAG GCCTTGAATA CATGGGTTCA GTAACATACG GCGTTATAAG CGGTTTAAAC AGAACTGTTC AGTTGGATGG AGGCAAGAGG ATAAGACTGT TGCAGACAGA TGCTGCCATA AACCCCGGAA ACAGTGGTGG TGCATTGGTA AATATCAAAG GACAGCTTAT TGGTGTAAAT ACAGTAAAGA TGGTTGCAAC AGGGTTTGAA GGACTTGGTT TTGCAATCCC CGTAAATGAG GCAAAAACAA TAGCAGATGA ACTTATCACA AAAACCTACA TTGCAAAACC TTATCTGGGT ATTTCAGTTA ACACGCAGTA TACAGAAGAT ATAGCAAAGG CAAATAATAT GCCTGCCGGA GTATACGTGG CAGATGTTGA ACTTTTTGGT GCAGCTGCTA AAGCGGGCAT AATGCCGGGT GATGTAATAA CCAAATTTAA CAACAAAGTA ATTAAGTCTT ATGATGAGCT TGAAGATACA AAGAATAAGA TGAAACCTGG GGATGTGGTT AAGATAGAGA TTTTCAGAGA CGGAAGTACA AAAACCGTTC AGGCTAAACT AGGTGAAACA AAGTAA
|
Protein sequence | MIDDRDDKIN ENGSSIDQPQ NDNGTYEANS NSESQESVNQ TQEDIQNGSS WQYETVTPVS ENKEPVDNVV GENTSQDTVT DNYNLDPQES QTETQDANKS GFGEDCNLET QVEPNSVYNN YYRENCKKTN TKKSNAWKYV LVSAVSSLVG AALLALLMLF VAPFVQPQIK SYLGNNFPGL KTESTQPNTG ELKRVEIVQS GESAVTSVAE KVGPSVVGIK TSYQNTNELF GVQSGGGEGS GIIISADGYI LTNHHVIEGA LNDKTRNIRS DAKIEVFLPN KIDKPYSAIV KGYDAKTDLA VLKINDTNLP VIEFGNSNDI KIGEPAIAVG NPGGLEYMGS VTYGVISGLN RTVQLDGGKR IRLLQTDAAI NPGNSGGALV NIKGQLIGVN TVKMVATGFE GLGFAIPVNE AKTIADELIT KTYIAKPYLG ISVNTQYTED IAKANNMPAG VYVADVELFG AAAKAGIMPG DVITKFNNKV IKSYDELEDT KNKMKPGDVV KIEIFRDGST KTVQAKLGET K
|
| |