Gene Ccel_0044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCcel_0044 
Symbol 
ID7312055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium cellulolyticum H10 
KingdomBacteria 
Replicon accessionNC_011898 
Strand
Start bp46249 
End bp47865 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content38% 
IMG OID643606972 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002504412 
Protein GI220927503 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAA TATTAGCTCT TGTGCTTTCG CTGGCAACTA TAATAACCGT ATTTGCCGGT 
TGTGGAAGTC CTTCTTCCAA TGATACTAAA GGCATAACTG CAAGTATCGC ATCAGAGCCA
GCATCTATAG ATCCTTCACT TAATAAATCT CTGGATGGCG GTACTTATAT TAACTATGCG
TTTGAGGGTT TAACCACTTA TGACAAAGAA GGTAAAATCG TTCCGGGAAC TGCTGAAAAA
TGGGATGTAA GTGCAGATAA CAAAGTTTAT ACATTCCATA TCCGTAAGGA TGCAAAATGG
TCAGACGGTA AAGCTGTAAC TGCAAATGAT TTTGTTTACT CATGGAAGAG AGCAGTTGAT
CCAAAAACAG CTTCCGACTA TGCTTATTAC CTGTATTTTG TTAAGAATGG TGAAGCTATC
AACACTAATG GTGCTGATAT CAATACATTA GGAATTAAGG CTATTGATGA TAATACACTT
GAAGTAACTC TGGAGAATGT TTGCCCATTC TTTACTGAAA TCGTAGCCTT CCCAACTCTT
GTTCCATTGA GAGAAGATGT TGTATCAAAA AATCCTGACA AATGGACTCT TGATCCAAAG
ACTTATATAG GAAATGGTCC ATATGTTATG ACAGATTGGA AGCACTCATC AAAGCTGGTA
TTTGAAAAGA ACGAAAACTA CTGGGGTAAA GACAATGTAG TTGCACCAAA GATTGAATGG
TTGTTAATGA ATGATCCAAA TGCTATTCTT GGTGCTTTCA AGAACAAACA GTTATCTTTC
GCAAGAAATA TACCGCATGA TGAAATTCCA GCAGAGAAAG ATGCCGGAAA TCTTCAGATT
TTCCCACAGC TTGGAACTTA CTATCTGGAT CTGCTCAATA CTAAAGCACC ATTTGATAAC
CCAAAGGTTA GAAAAGCAGT ATCTCTTGCA ATTGACCGTA ACTACATCAC TGAAAAGGTT
AGAAAAGCAG GCGAAACTCC TGCATCAGGA TTTGTACCAT ACGGTATAGC TGATGTCAGC
CAAGACCCTG ATTTCCGTAC AAAAGGCGGA GATTTCTACT CTGTTAAACC TGAAGATTAT
GAGAAGAATG TTGCTGAAGC AAAGAAACTC TTAGCTGAAG CTGGATATCC AGATGGAAAA
GGATTCCCTA AAATAACCTT TGGTTTAAAC ACAGGTTCAG GTCATGAAGC AGTTGCTGAA
GCAATTCAAC AACAGTTAAA GACAAATCTG GGAATCGAAG TTGAAATTCA AGCACAGGAA
TGGAATGTAT TCCAAGAGTC CAGAAAGAAT GGTTTGTTTG ATATCGCTCG TGACGGTTGG
ATTGGAGACT ACATGGATCC TTCAACATTT ATGGATCTGA TAACTTCAAA TAACCCTCAG
AATAACTCAA AATATAATAA CGCTGCATAT GATAAGGCAA TCGCTGATGC CAGAAAAGAA
ACCGATCCTG CAAAGCGCAT GCAATTGTAC CATGACGCAG AAAACCTTCT CATGGAAGAG
GCAGGAGTTG CACCACTATT CTTCTATACT GATCCACTCA TTATAGATAA GAACTTACAA
GACTATGTAG TAACCAAGCT CGGTTTTATT TATTTACAAT GGGCATCATT CAAATAA
 
Protein sequence
MKRILALVLS LATIITVFAG CGSPSSNDTK GITASIASEP ASIDPSLNKS LDGGTYINYA 
FEGLTTYDKE GKIVPGTAEK WDVSADNKVY TFHIRKDAKW SDGKAVTAND FVYSWKRAVD
PKTASDYAYY LYFVKNGEAI NTNGADINTL GIKAIDDNTL EVTLENVCPF FTEIVAFPTL
VPLREDVVSK NPDKWTLDPK TYIGNGPYVM TDWKHSSKLV FEKNENYWGK DNVVAPKIEW
LLMNDPNAIL GAFKNKQLSF ARNIPHDEIP AEKDAGNLQI FPQLGTYYLD LLNTKAPFDN
PKVRKAVSLA IDRNYITEKV RKAGETPASG FVPYGIADVS QDPDFRTKGG DFYSVKPEDY
EKNVAEAKKL LAEAGYPDGK GFPKITFGLN TGSGHEAVAE AIQQQLKTNL GIEVEIQAQE
WNVFQESRKN GLFDIARDGW IGDYMDPSTF MDLITSNNPQ NNSKYNNAAY DKAIADARKE
TDPAKRMQLY HDAENLLMEE AGVAPLFFYT DPLIIDKNLQ DYVVTKLGFI YLQWASFK