Gene Ccel_0608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0608 
Symbol 
ID7312125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp702983 
End bp704632 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content35% 
IMG OID643607548 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002504969 
Protein GI220928060 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.995783 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGACAA AAGCCTTGAG ACTTATTTCT TTATTCATAA TTACAGGACT GATGCTATCT 
GCATGTTCTG TTAATCTTAA TAAAAAATCT GCCAATCAAG ATGAGGATAT TTATGAAGGT
AAATATGATA TATTGGACAA AGGCCCTGAA AAAGGAGGGT CTATACGTCT GTTCAGTACA
CCTGTAGATA CTTTAAATCC AATTTTGACT AATAACCAGT ATGTTCAGGA TTTTTTGGGA
TTTGTATTTG AAGGACTCTA TAGATTAGAC GAAAAGCAGC AGCCTGTGCC TGTTTTAGCA
GAAAGAGCAG TTACTTCAGC TGACGGATTA AAACTTACAG TAACTTTAAA AAAAGGAATT
AAATGGCACA ATGGATTACC GCTTCAAGCC GGAGATGTAG TGTTTACTAT AAATAGTATA
ATGGATACTA AGAACAGCAG CGTGTATGCA GCTAACTTAC AGAATATCGC TTCTGTAACT
GCGGGAAATA ATAATTCAGT TGTAATTACG TTGAAAAAAC CTGATTCAAT GCTGTTATAC
AGCCTGACCT TTCCCGTTAT ATCTATGCAG TATTTTAATA AAGAAAAATT GAGTGATAAA
AATTCAAAGA AAAATCTCTC ACCTGTAGGT ACGGGACCTT ATACTTTTGT ATCATATAAT
GCAAAAAACG GAGTAAAATT TAAAGCCAAC GATGATTGGT GGAACAAAGG CAATTCAGAA
GTAACGACTC CCTATATCCA ATCATTGGAG ATTAAAATAT TCGAGAATGC CGGGAAAGCC
ACTAAGGTCT TTCAGTCCAG GGATGTTGAT GTGGTTACGG TTGATCACAG TGAGTTTAAA
AAGTATATCA ATCGTACTGA TATTTCACTC AAACGTTATC CCGGTAAAAA CTATGAATTT
CTATCACTCA ATGTTACAAA AGGGCCAATG GCAAATAAAA ATTTGAGAAG TGCTTTGGGT
GGATTTATAG ATAAGAAAAA GCTTATTGAT ACTGCAGTAC AGGGGATTGC GATACCTGCT
GAATTACCGC TTTTCCCTAA CTCTTGGATA AATCAGTTGG TAAATATGGA ACAGTATTCA
GACTTAAAAA AGGCGAAACA GCTTATGACA CAAAGCGGAT ATGTTCTTTC GAAAAATAAG
TATGTAAGCA AAGCAAACAG TAGAGCATTG TCATTAAAGC TTATTGTTAA TCAGGATAAC
ACATTAAGAG TAAATACTGC CGATGCTATC GCATCTCAAT TGGTTAAAAA TGGAATAAAT
GTGGAGGTTG AAAAGCTGAC TTGGGAGAAT GTGCAAAAAC GAATAAAATC CGGTGCATAT
GATATGGCTT TACTGGGATA TCAAATTTCA ACAAAACCGG ATTTGTCCTT TGCTTACTCT
ACAGATAGTA TAGAGTCAGG GCTCAATACG GCAAAGTACA GCAACCCTGC TGTTGACGGG
TATCTTCAAC AAATTTTAAC TCAATCTGAC ATTGAAAAAC AGAAAAGTTT ATATACCAAA
CTTTTAAATA CTGTTCTTGA CGAAAGGCCG TACATAGGCT TATATTTTAT CTCCCAAGGT
ATAATGTGCA GTAAAAATAT TAAAGGAGCG ATAAACCCTA ATGTATGGAA CAGTTATAAC
GATATTTCAC AGTGGTATGT ACCGCAATAA
 
Protein sequence
MMTKALRLIS LFIITGLMLS ACSVNLNKKS ANQDEDIYEG KYDILDKGPE KGGSIRLFST 
PVDTLNPILT NNQYVQDFLG FVFEGLYRLD EKQQPVPVLA ERAVTSADGL KLTVTLKKGI
KWHNGLPLQA GDVVFTINSI MDTKNSSVYA ANLQNIASVT AGNNNSVVIT LKKPDSMLLY
SLTFPVISMQ YFNKEKLSDK NSKKNLSPVG TGPYTFVSYN AKNGVKFKAN DDWWNKGNSE
VTTPYIQSLE IKIFENAGKA TKVFQSRDVD VVTVDHSEFK KYINRTDISL KRYPGKNYEF
LSLNVTKGPM ANKNLRSALG GFIDKKKLID TAVQGIAIPA ELPLFPNSWI NQLVNMEQYS
DLKKAKQLMT QSGYVLSKNK YVSKANSRAL SLKLIVNQDN TLRVNTADAI ASQLVKNGIN
VEVEKLTWEN VQKRIKSGAY DMALLGYQIS TKPDLSFAYS TDSIESGLNT AKYSNPAVDG
YLQQILTQSD IEKQKSLYTK LLNTVLDERP YIGLYFISQG IMCSKNIKGA INPNVWNSYN
DISQWYVPQ