Gene Ccel_3463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_3463 
Symbol 
ID7312021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp4032256 
End bp4034331 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content37% 
IMG OID643610373 
ProductCarbohydrate-binding family 9 
Protein accessionYP_002507731 
Protein GI220930822 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2382] Enterochelin esterase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0207696 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCAAGG AAAAGAAGAA ACCGCTAAAA CACTTTATTA TTGTGCTGGC TGTTTTCTCA 
ATGGTATTCA CGAATTTTAT CGGTTCGTGG TCAGGTGTCT CCGCAGCTCC TGCAAAGGGT
AAGCCTTCAT TTAAACAAGC AGCTTCTAAA GAGCTGAAGG TGGGGCAAAC CTACACTTTT
GATATCATCA ATAAGCAGAC AGGTGCAACC TACCAGTGGA GCAGCAGCAA CAAATCAATT
GCTACAGTAA CCCAAAAAGG GCAAGTTAAG GCAATCTCTG TGGGTACTGC AACAATTTCC
TGTAAAATAA CTAAAAACAA AAAGTCAACT ACAATTAAGG CAAAAGTCAT TGTTAAAAAG
GAAAAGGCCC CTGCTAAACC TGCCCCACAA CCCGCAAAAG TATCGGTAGA CAAAAATGGA
TTTGATTCCA AAGGAAGAAT GGTCGCTTAT TTTGGTTCTC CAGTGATTGA CGGTAAAGTG
GATGCTGCTT GGAGTAAAGC ACAAGTTGTT ACTCCGAAAT ACGTAACAGA TAATATTGGT
ACAACAGCTA CTTTCAGAAC TCTATGGGAT GATAATGCAA TCTACATACT TGCAGAAGTA
AAGGATAAAA ATATGTCTGT TGAATCCGAC ACTCCATATA TGCAAGATTC GTTAGAGATA
TTTTTAGATG AGAATAATGA TAAAACACAA GAATATGGCA TTGATGACTT ACACTTCCGT
GTTAATTATA AGAATTCCCA GTCTGTGGAT GTTGGAAATG CCGAACGCTT TTACACCAGT
ACAAAAAAAG TGAAGGGTGG ATATCTAGTA GAAGCAAGAG TTGCTTTGAA TGCCAAACCC
TCAGACGGAA AGGTACTAGG AATTGAACTG CAAATTAATG AAGCAAAAGA TACTGAAAGA
ATCGGTACTA TTAATATTTT TGATTCTACA GGCAGTGCAT GGAACGACAC CAATAAATTT
GGTGAAGTAT TGCTTACAGG CAAGGCCAAA GGAGCGGTTA GTGGATTGAA CCCATATGAT
CTTATAAAAC TGGTAGAAAG TACTCAAAAA ATTGATTTCA CACTTTATAA AAATCCAAAT
GTGGTAAAAG ACGCTGTTAA ATCTGCTCAA AAGGTAATTG CTAATAAAAA AGCCACCCAG
AAACAAATTG ATGCACAATA TTCCGCAATA AAAGCAGCAA TCGGTACATT AGTTCTCACT
GATGAAGCTG CAAATGAAAA ATACTTTAAA GCAGTCCCGG ATAATTATAG AACGGAAAAT
GATAAACAAG GTACTATTGT AAGCTTAGAG TACTCTGCGG ATAATTTAAA AAATGGAACA
GACGTTAAGA AGATGAATGT TTACCTTCCT TACGGTTATG ATGCATCCGA TAAAAATAAA
AAATATAATG TTTTATATTT AATGCATGGC GGTGGTGAAA ATGAAAATCT CATTTTTGGA
GGACCTGCCC AAAACAAAGA ACTAAAGAAA ATCATAGATA ATATGATTTC AAATGGAGAT
ATCGAACCGC TAATTGTTGT AACACCTACA TTTTATGGTG GAAAAGATGA TACTGCACTC
TTCCATGAAG AACTGATGAA AAATGTTGTT CCTTTAGTAG AGACTAAGTA TAATACATAT
ACAAAATCAG CAAGTCTGGA AGATTTGAAG GCTTCTAGAG AACATAGAGC ATTTGGCGGA
TTTTCTATGG GTTCGGTAAC TACCTGGTAT ACATATATCA ATTGCTTGGA CTATTTCAAA
TATTTTATTC CGATAAGCGG TGACTGTTGG GTATTTGGTC AGAAAGCCGG AAGCGAGAAG
TCAAAAGAAA CAGCAGAATA TCTCGCTAAG GTTGCAAAAG ATTCCGGTTA TAGCCCACAG
GATTATTACT TGTTCTGTGC TACAGGAAGC TTGGATATTG CTTATCCTAA CTTAAAGCCT
CAGGTGGATG CTATGAAGCA ATTAAAAGAC ACCTTTATTT ATTCATCTGA TACAACAAAA
GGAAATTTCT ATTTCATAGT AAGCGATGGT GGTACACATG CTTGGAATTG GGTTAATCAG
TATATTTATG ATATATTGCC TGATCTATTT AAATAG
 
Protein sequence
MGKEKKKPLK HFIIVLAVFS MVFTNFIGSW SGVSAAPAKG KPSFKQAASK ELKVGQTYTF 
DIINKQTGAT YQWSSSNKSI ATVTQKGQVK AISVGTATIS CKITKNKKST TIKAKVIVKK
EKAPAKPAPQ PAKVSVDKNG FDSKGRMVAY FGSPVIDGKV DAAWSKAQVV TPKYVTDNIG
TTATFRTLWD DNAIYILAEV KDKNMSVESD TPYMQDSLEI FLDENNDKTQ EYGIDDLHFR
VNYKNSQSVD VGNAERFYTS TKKVKGGYLV EARVALNAKP SDGKVLGIEL QINEAKDTER
IGTINIFDST GSAWNDTNKF GEVLLTGKAK GAVSGLNPYD LIKLVESTQK IDFTLYKNPN
VVKDAVKSAQ KVIANKKATQ KQIDAQYSAI KAAIGTLVLT DEAANEKYFK AVPDNYRTEN
DKQGTIVSLE YSADNLKNGT DVKKMNVYLP YGYDASDKNK KYNVLYLMHG GGENENLIFG
GPAQNKELKK IIDNMISNGD IEPLIVVTPT FYGGKDDTAL FHEELMKNVV PLVETKYNTY
TKSASLEDLK ASREHRAFGG FSMGSVTTWY TYINCLDYFK YFIPISGDCW VFGQKAGSEK
SKETAEYLAK VAKDSGYSPQ DYYLFCATGS LDIAYPNLKP QVDAMKQLKD TFIYSSDTTK
GNFYFIVSDG GTHAWNWVNQ YIYDILPDLF K