Gene Ccel_1768 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1768 
Symbol 
ID7310502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2115803 
End bp2117542 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content38% 
IMG OID643608699 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002506099 
Protein GI220929190 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000449992 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA AATTAGCAAT ATTGTTAATT GTATTGTCAA TTATACTGAC GGTACCTGCA 
TGCTCAAGTA AATCTGATTC TGGAGATACT TCCGGCGGCT CATCAACAGG AACAAAGGTC
AGTCAGACAG GATTTGATTA CAAGAAGTAC GGAGTTGAAT ATACTGCTTC TACTGACACC
GCAAAAAGTC CCAAGGTTGC AACAGACAGA AAGGACACAT TAGTTGTTGG ATTGCCGGAT
ACAACAGGAA TATTTAATTA TTTGTACGGC GATAATGCTT ATGATTGGTT TGCGATCTAT
ACCATGTTTG ATTTTAATAT AGACGTTGAT TTCGATGGTA AGGCGATACC CGGTGCCACT
GACTATACAA TTTCGGAGGA TGGGCTGACA TATACTTTTA AAATAAAAGA CGGAGTTAAG
TTCTGGGATG GAAACCCTGC TACAGCCTCT GATTTGGAGT TTGCATACTA TCTGGAAGCT
GATCCCAAAT ATGATGGACC TTCGGATATA TCAAAAACAT TCATAAAAGG TCTTGATCCA
TATAAAAATG GAAATGCTGA CAAAATCGAA GGAATAAAGG TGCTTGATGA TAAAACATTG
CAGATAACCG TTGATAAAGC CAGCGGCCCT GCAATATATG CGTTGCAGGT TCCGTTACTT
GAAAAGAAGT ATTACGGTGC TGATTTCAAA AAGGGTGATA CTGCAAAGGT AAAAGAAAAA
AACGGAGCAC CCATGGGCAC AGGTCAATAC AAATTTGTTG AGTATAAAGC GGGTCAGGAG
CTGAAACTTG TAGCCAACGA GAATTATTTC AAGGGAGCTC CTAAAATTAA AAATCTGATA
TTTTCAGTGA CACCGACAGG GCAGGAGCTT CAAAGGGTTA TGGCAGGAGA GACAGATATT
GATATGGCTG ATGTTTCACC TGATAATATG AAAGCAGCAA AGGATGCAGG GTTTATAGAC
ATATACAGAT TTGCTACAAA CGGTTACGGA TTTGTGGGAT TAAACGATGC TGATCCTAAA
TTCAGTGATG TGAAAGTACG GCAGGCTCTT ATGTATGCTC TTAACAGAGC TGCTGTTGTA
GAAAAGGTAT ACGGTGAATA TGCAAGAGTC GTGAACATAC CTGAATCAAA TGTATCATGG
GCATACGACG ATGAAGGGTG CAATACATAT GAATACAATC TTGATAAAGC AGGACAGCTG
CTGGATGAAG CGGGTTGGAA GCTAAACAGC AACGGAAAAC GTGAAAAGGA CGGCAAAGAA
TTTAAAATCA AGTTCTCCTG CATGAGCCCT CATCCTGTAA CGGACATTAT GGTTCCTGTT
ATGAAAGACG ATTATGCAAA GCTGGGAATA GATGTTACTG TTGAGAATCT TGATTGGCCG
ACTCTTTATC AAAAGGCAAC TAAAAAGCAG CTGGATGCTT ATTTTATGGC AAATGGACTT
ACTCCGGATC CTGACAATTC ATTAGCAAAT GCATACAAAT CAGATGCATC TCAAAATTAT
TATAATTACA AAAATAACGA AGTTGATAAG CTTTGTGAAG AAGGTCTCAA AGAAATAAGC
ACAGAAAAGA GAAAGCCCAT TTACAAGGAA CTATACAAAA TCTTGAATAA CGACTTACCT
GTACTTTTTG TATATCAGAG AAGTGACATG TGGGTAGCTA ACTCCAGAAT AAAAAACTAC
GAACTTTCTT CTTTCAGAGA TTTTTTCTAT AACTTATATA AAGCCGAAAT TGGAAAGTAA
 
Protein sequence
MKKKLAILLI VLSIILTVPA CSSKSDSGDT SGGSSTGTKV SQTGFDYKKY GVEYTASTDT 
AKSPKVATDR KDTLVVGLPD TTGIFNYLYG DNAYDWFAIY TMFDFNIDVD FDGKAIPGAT
DYTISEDGLT YTFKIKDGVK FWDGNPATAS DLEFAYYLEA DPKYDGPSDI SKTFIKGLDP
YKNGNADKIE GIKVLDDKTL QITVDKASGP AIYALQVPLL EKKYYGADFK KGDTAKVKEK
NGAPMGTGQY KFVEYKAGQE LKLVANENYF KGAPKIKNLI FSVTPTGQEL QRVMAGETDI
DMADVSPDNM KAAKDAGFID IYRFATNGYG FVGLNDADPK FSDVKVRQAL MYALNRAAVV
EKVYGEYARV VNIPESNVSW AYDDEGCNTY EYNLDKAGQL LDEAGWKLNS NGKREKDGKE
FKIKFSCMSP HPVTDIMVPV MKDDYAKLGI DVTVENLDWP TLYQKATKKQ LDAYFMANGL
TPDPDNSLAN AYKSDASQNY YNYKNNEVDK LCEEGLKEIS TEKRKPIYKE LYKILNNDLP
VLFVYQRSDM WVANSRIKNY ELSSFRDFFY NLYKAEIGK