Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_1768 |
Symbol | |
ID | 7310502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 2115803 |
End bp | 2117542 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643608699 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002506099 |
Protein GI | 220929190 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000449992 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAA AATTAGCAAT ATTGTTAATT GTATTGTCAA TTATACTGAC GGTACCTGCA TGCTCAAGTA AATCTGATTC TGGAGATACT TCCGGCGGCT CATCAACAGG AACAAAGGTC AGTCAGACAG GATTTGATTA CAAGAAGTAC GGAGTTGAAT ATACTGCTTC TACTGACACC GCAAAAAGTC CCAAGGTTGC AACAGACAGA AAGGACACAT TAGTTGTTGG ATTGCCGGAT ACAACAGGAA TATTTAATTA TTTGTACGGC GATAATGCTT ATGATTGGTT TGCGATCTAT ACCATGTTTG ATTTTAATAT AGACGTTGAT TTCGATGGTA AGGCGATACC CGGTGCCACT GACTATACAA TTTCGGAGGA TGGGCTGACA TATACTTTTA AAATAAAAGA CGGAGTTAAG TTCTGGGATG GAAACCCTGC TACAGCCTCT GATTTGGAGT TTGCATACTA TCTGGAAGCT GATCCCAAAT ATGATGGACC TTCGGATATA TCAAAAACAT TCATAAAAGG TCTTGATCCA TATAAAAATG GAAATGCTGA CAAAATCGAA GGAATAAAGG TGCTTGATGA TAAAACATTG CAGATAACCG TTGATAAAGC CAGCGGCCCT GCAATATATG CGTTGCAGGT TCCGTTACTT GAAAAGAAGT ATTACGGTGC TGATTTCAAA AAGGGTGATA CTGCAAAGGT AAAAGAAAAA AACGGAGCAC CCATGGGCAC AGGTCAATAC AAATTTGTTG AGTATAAAGC GGGTCAGGAG CTGAAACTTG TAGCCAACGA GAATTATTTC AAGGGAGCTC CTAAAATTAA AAATCTGATA TTTTCAGTGA CACCGACAGG GCAGGAGCTT CAAAGGGTTA TGGCAGGAGA GACAGATATT GATATGGCTG ATGTTTCACC TGATAATATG AAAGCAGCAA AGGATGCAGG GTTTATAGAC ATATACAGAT TTGCTACAAA CGGTTACGGA TTTGTGGGAT TAAACGATGC TGATCCTAAA TTCAGTGATG TGAAAGTACG GCAGGCTCTT ATGTATGCTC TTAACAGAGC TGCTGTTGTA GAAAAGGTAT ACGGTGAATA TGCAAGAGTC GTGAACATAC CTGAATCAAA TGTATCATGG GCATACGACG ATGAAGGGTG CAATACATAT GAATACAATC TTGATAAAGC AGGACAGCTG CTGGATGAAG CGGGTTGGAA GCTAAACAGC AACGGAAAAC GTGAAAAGGA CGGCAAAGAA TTTAAAATCA AGTTCTCCTG CATGAGCCCT CATCCTGTAA CGGACATTAT GGTTCCTGTT ATGAAAGACG ATTATGCAAA GCTGGGAATA GATGTTACTG TTGAGAATCT TGATTGGCCG ACTCTTTATC AAAAGGCAAC TAAAAAGCAG CTGGATGCTT ATTTTATGGC AAATGGACTT ACTCCGGATC CTGACAATTC ATTAGCAAAT GCATACAAAT CAGATGCATC TCAAAATTAT TATAATTACA AAAATAACGA AGTTGATAAG CTTTGTGAAG AAGGTCTCAA AGAAATAAGC ACAGAAAAGA GAAAGCCCAT TTACAAGGAA CTATACAAAA TCTTGAATAA CGACTTACCT GTACTTTTTG TATATCAGAG AAGTGACATG TGGGTAGCTA ACTCCAGAAT AAAAAACTAC GAACTTTCTT CTTTCAGAGA TTTTTTCTAT AACTTATATA AAGCCGAAAT TGGAAAGTAA
|
Protein sequence | MKKKLAILLI VLSIILTVPA CSSKSDSGDT SGGSSTGTKV SQTGFDYKKY GVEYTASTDT AKSPKVATDR KDTLVVGLPD TTGIFNYLYG DNAYDWFAIY TMFDFNIDVD FDGKAIPGAT DYTISEDGLT YTFKIKDGVK FWDGNPATAS DLEFAYYLEA DPKYDGPSDI SKTFIKGLDP YKNGNADKIE GIKVLDDKTL QITVDKASGP AIYALQVPLL EKKYYGADFK KGDTAKVKEK NGAPMGTGQY KFVEYKAGQE LKLVANENYF KGAPKIKNLI FSVTPTGQEL QRVMAGETDI DMADVSPDNM KAAKDAGFID IYRFATNGYG FVGLNDADPK FSDVKVRQAL MYALNRAAVV EKVYGEYARV VNIPESNVSW AYDDEGCNTY EYNLDKAGQL LDEAGWKLNS NGKREKDGKE FKIKFSCMSP HPVTDIMVPV MKDDYAKLGI DVTVENLDWP TLYQKATKKQ LDAYFMANGL TPDPDNSLAN AYKSDASQNY YNYKNNEVDK LCEEGLKEIS TEKRKPIYKE LYKILNNDLP VLFVYQRSDM WVANSRIKNY ELSSFRDFFY NLYKAEIGK
|
| |