Gene Ccel_1232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_1232 
Symbol 
ID7310029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp1510665 
End bp1512137 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content41% 
IMG OID643608153 
ProductCarbohydrate binding family 6 
Protein accessionYP_002505568 
Protein GI220928659 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2382] Enterochelin esterase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.122742 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAAA ATATATTAAC TAAAAAAACG ATTTGGGGAA TGGTGGCATT TGTTTTTGCT 
TTAACTCTAG TTTTTACAGT GCCTGATTCC AAGGTGCAGG CTGCAGCACT GCCTACCACA
CCACCGACAG GCTATGACCG AGTCCAGAGT AACATTCCGC ACGGTCAGGT TAGCTATATT
AATTACCAAT CTAAGGCAAC AAACAGTCAA AGAAGAGCAA GGATTTACCT GCCTCCAAAT
TATTCAGCAG ACAAAAAATA CAGTGTAATG TATTTACTGC ATGGTATCGG TGGAAATGAA
GACGAGTGGT ACAATAACGG TGCACCTAAC GTTATTCTTG ACAATCTTAT AGCTGCAGGT
AAAATTCAGC CATTTATCGT TGTATTACCA AATGGCAATG CAACAGGAAC TGGTGTATCT
GATGGTTGGA CTAATTTTAC AAAAGATTTA ATCGAAAGTC TTATCCCATA TATAGAATCA
AACTACTCCG TTTATACGGA TCGTAATCAC AGGACTGTTT GCGGTCTCTC AATGGGTGGC
GGACAATCTT TCAATATCGG ACTTCCTAAC TTGAACCTGT TCCCATATGT TGGAGCATTT
TCACCGGCAC CTAATACACT CCCTAATTCC CAACTTTTCC CTAATGACGG AGCCTCAGCT
AAGCAGCTGT TGAAATTCTT GTTTATTTCT TACGGAACTA CCGACAGTCT GATTAGTTTT
GGTACAGGAG TACATAATTA CTGTGATTCC CAGAGTATTC CAAATACTTA CTTCCTTATT
CAGGGTGCAG GTCATGATTG GAACGTATGG AAGCAGAGCC TTTGGAATTA TTCTCAAATG
ATATGTGAAA AGGGTTTTAC AGACTATGGC CCTGTTGCAC CTGTATCAGC ATTTACACAA
ATTGAGGCAG AAAGTTTCAC CAGTCAGTCC GGCGTTCAGA CAGAAACCTG TACTGAAGGA
GGTCTGAATG TAGGGTATAT TGAGAATGGT GATTATGTAG TTTATAACAA TGTAGACTTT
GGCAGTGGTG CAACAAGCTT TCAGGCAAGA GTAGCAAGTG CTGGGAATGG AGGTAATATC
GAAATCAGAT TGGATAGTAT AACAGGTCCG TTGGTAGGAA CTTGTGCAGT TAAAGGCACA
GGCGATTGGC AGACTTGGAC TGATGCAAAG TGTACTGTAA GCGGAGTAAC CGGAAAACAT
GACTTGTATC TGAAATTTAC AGGTGGAAGC GATTATCTGA TGAATTTTAA CTGGTTCAAA
TTTGGTAATG CGACACAGAC TCTTACAGGT GATCTTAATG GAGATGCCAG TGAAGATGCA
ACAGACTATG CCTTGCTGAA AAAGTATCTT CTTGGTCAGA TTAATGACTT CCCTGTCGAA
GACGACATTA ATGCTGGAGA TATGAATAAG GACGGTGTAA TTGACGCTCT CGACTTTGCT
GTCTTTAAGA AAATCCTTTT GGGCACAATC TAA
 
Protein sequence
MIKNILTKKT IWGMVAFVFA LTLVFTVPDS KVQAAALPTT PPTGYDRVQS NIPHGQVSYI 
NYQSKATNSQ RRARIYLPPN YSADKKYSVM YLLHGIGGNE DEWYNNGAPN VILDNLIAAG
KIQPFIVVLP NGNATGTGVS DGWTNFTKDL IESLIPYIES NYSVYTDRNH RTVCGLSMGG
GQSFNIGLPN LNLFPYVGAF SPAPNTLPNS QLFPNDGASA KQLLKFLFIS YGTTDSLISF
GTGVHNYCDS QSIPNTYFLI QGAGHDWNVW KQSLWNYSQM ICEKGFTDYG PVAPVSAFTQ
IEAESFTSQS GVQTETCTEG GLNVGYIENG DYVVYNNVDF GSGATSFQAR VASAGNGGNI
EIRLDSITGP LVGTCAVKGT GDWQTWTDAK CTVSGVTGKH DLYLKFTGGS DYLMNFNWFK
FGNATQTLTG DLNGDASEDA TDYALLKKYL LGQINDFPVE DDINAGDMNK DGVIDALDFA
VFKKILLGTI