Gene Ccel_1987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1987 
Symbol 
ID7312309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp2352953 
End bp2354089 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content40% 
IMG OID643608922 
Productputative solute-binding component of ABC transporter 
Protein accessionYP_002506315 
Protein GI220929406 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGG TAATCGCTTT GATTTTAGTA GCTGTCCTTG CTGTGGGTAT GCTGGCAGCT 
TGTGGTACTT CAACCTCAAC TGATGCTTCC GTATCTTCAT CAGCAGCATC TACTGATTCT
GCAAAAGGCA CCGATACAGC GGCAAGCGGG CAGTTAATCG GTGTAGCAAT GCCTACTCAG
TCATTACAGC GTTGGAACCA GGATGGTGCC AATTTGAAGA AGGAACTTGA AGGAAAAGGC
TACAAGGTTG ATCTTCAGTA TGCTAACAAC GATGTAAACA CTCAGATTCA GCAGATTGAA
AACATGATTG TTAAGGGCAG TAAAGTTCTT GTTATCGCCG CTATTGACTG TTCAGCACTT
TCTGACGTTT TGAAAAAAGC AGCAGATAAC AGTGTAAAAG TAATATCTTA TGACAGACTT
ATAATGAAGA CTCCAAACGT AGACTATTAT GCAACATTTG ATAATTTCAA GGTTGGGGTG
ATTCAAGGTC AATATATTGA AACAAAATTG GGCCTTAAAG ATGGCAAAGG ACCATTCAAT
ATTGAACTTT TTGGTGGATC ACCTGATGAT AATAATGCAA ATTACTTCTT TGACGGTGCA
TATAGTATTT TAAAACCATA TATAGACAGC GGTAAGCTGG TTGTCACAAG CGGTCAGAAA
GATTTTGCAA AAATCGCAAT CCAGGGTTGG GATTCTGCAA AAGCACAAGC AAGAATGGAT
AATTTGATTA CAGCTAATTA TGCAGGCGGT AAAAAACTTG ATGCAGTTCT TTCACCAAAT
GACAGCCTTG CAATCGGTAT TGTTGCATCA CTTAAAAATG CAGGCTATGG TAGCAGTGAC
AAGCCATATC CGATTATTAC CGGACAGGAC TGCGACAAGC CAAATGTTAT AGCAATGATT
AATGGACAGC AGTCAATGTC AATATTCAAG GACACAAGAA CTCTTGCATC AAAGGTTGTT
GAAATGATTG ACTCCCTGTT ACAAGGAAAA GAAGCTCCTG TTTACGACAC TAAGACTTAT
GACAACGAAA GTAAAGTTGT TCCTTCATTC CTTTGTGAAC CCGTATACGC TGACAAAGAC
AACTACAAGA AAATTCTCGT TGACAGCGGT TATTACAAGG AATCTGACCT GAAGTAA
 
Protein sequence
MKKVIALILV AVLAVGMLAA CGTSTSTDAS VSSSAASTDS AKGTDTAASG QLIGVAMPTQ 
SLQRWNQDGA NLKKELEGKG YKVDLQYANN DVNTQIQQIE NMIVKGSKVL VIAAIDCSAL
SDVLKKAADN SVKVISYDRL IMKTPNVDYY ATFDNFKVGV IQGQYIETKL GLKDGKGPFN
IELFGGSPDD NNANYFFDGA YSILKPYIDS GKLVVTSGQK DFAKIAIQGW DSAKAQARMD
NLITANYAGG KKLDAVLSPN DSLAIGIVAS LKNAGYGSSD KPYPIITGQD CDKPNVIAMI
NGQQSMSIFK DTRTLASKVV EMIDSLLQGK EAPVYDTKTY DNESKVVPSF LCEPVYADKD
NYKKILVDSG YYKESDLK