Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ccel_0044 |
Symbol | |
ID | 7312055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium cellulolyticum H10 |
Kingdom | Bacteria |
Replicon accession | NC_011898 |
Strand | - |
Start bp | 46249 |
End bp | 47865 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643606972 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002504412 |
Protein GI | 220927503 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAA TATTAGCTCT TGTGCTTTCG CTGGCAACTA TAATAACCGT ATTTGCCGGT TGTGGAAGTC CTTCTTCCAA TGATACTAAA GGCATAACTG CAAGTATCGC ATCAGAGCCA GCATCTATAG ATCCTTCACT TAATAAATCT CTGGATGGCG GTACTTATAT TAACTATGCG TTTGAGGGTT TAACCACTTA TGACAAAGAA GGTAAAATCG TTCCGGGAAC TGCTGAAAAA TGGGATGTAA GTGCAGATAA CAAAGTTTAT ACATTCCATA TCCGTAAGGA TGCAAAATGG TCAGACGGTA AAGCTGTAAC TGCAAATGAT TTTGTTTACT CATGGAAGAG AGCAGTTGAT CCAAAAACAG CTTCCGACTA TGCTTATTAC CTGTATTTTG TTAAGAATGG TGAAGCTATC AACACTAATG GTGCTGATAT CAATACATTA GGAATTAAGG CTATTGATGA TAATACACTT GAAGTAACTC TGGAGAATGT TTGCCCATTC TTTACTGAAA TCGTAGCCTT CCCAACTCTT GTTCCATTGA GAGAAGATGT TGTATCAAAA AATCCTGACA AATGGACTCT TGATCCAAAG ACTTATATAG GAAATGGTCC ATATGTTATG ACAGATTGGA AGCACTCATC AAAGCTGGTA TTTGAAAAGA ACGAAAACTA CTGGGGTAAA GACAATGTAG TTGCACCAAA GATTGAATGG TTGTTAATGA ATGATCCAAA TGCTATTCTT GGTGCTTTCA AGAACAAACA GTTATCTTTC GCAAGAAATA TACCGCATGA TGAAATTCCA GCAGAGAAAG ATGCCGGAAA TCTTCAGATT TTCCCACAGC TTGGAACTTA CTATCTGGAT CTGCTCAATA CTAAAGCACC ATTTGATAAC CCAAAGGTTA GAAAAGCAGT ATCTCTTGCA ATTGACCGTA ACTACATCAC TGAAAAGGTT AGAAAAGCAG GCGAAACTCC TGCATCAGGA TTTGTACCAT ACGGTATAGC TGATGTCAGC CAAGACCCTG ATTTCCGTAC AAAAGGCGGA GATTTCTACT CTGTTAAACC TGAAGATTAT GAGAAGAATG TTGCTGAAGC AAAGAAACTC TTAGCTGAAG CTGGATATCC AGATGGAAAA GGATTCCCTA AAATAACCTT TGGTTTAAAC ACAGGTTCAG GTCATGAAGC AGTTGCTGAA GCAATTCAAC AACAGTTAAA GACAAATCTG GGAATCGAAG TTGAAATTCA AGCACAGGAA TGGAATGTAT TCCAAGAGTC CAGAAAGAAT GGTTTGTTTG ATATCGCTCG TGACGGTTGG ATTGGAGACT ACATGGATCC TTCAACATTT ATGGATCTGA TAACTTCAAA TAACCCTCAG AATAACTCAA AATATAATAA CGCTGCATAT GATAAGGCAA TCGCTGATGC CAGAAAAGAA ACCGATCCTG CAAAGCGCAT GCAATTGTAC CATGACGCAG AAAACCTTCT CATGGAAGAG GCAGGAGTTG CACCACTATT CTTCTATACT GATCCACTCA TTATAGATAA GAACTTACAA GACTATGTAG TAACCAAGCT CGGTTTTATT TATTTACAAT GGGCATCATT CAAATAA
|
Protein sequence | MKRILALVLS LATIITVFAG CGSPSSNDTK GITASIASEP ASIDPSLNKS LDGGTYINYA FEGLTTYDKE GKIVPGTAEK WDVSADNKVY TFHIRKDAKW SDGKAVTAND FVYSWKRAVD PKTASDYAYY LYFVKNGEAI NTNGADINTL GIKAIDDNTL EVTLENVCPF FTEIVAFPTL VPLREDVVSK NPDKWTLDPK TYIGNGPYVM TDWKHSSKLV FEKNENYWGK DNVVAPKIEW LLMNDPNAIL GAFKNKQLSF ARNIPHDEIP AEKDAGNLQI FPQLGTYYLD LLNTKAPFDN PKVRKAVSLA IDRNYITEKV RKAGETPASG FVPYGIADVS QDPDFRTKGG DFYSVKPEDY EKNVAEAKKL LAEAGYPDGK GFPKITFGLN TGSGHEAVAE AIQQQLKTNL GIEVEIQAQE WNVFQESRKN GLFDIARDGW IGDYMDPSTF MDLITSNNPQ NNSKYNNAAY DKAIADARKE TDPAKRMQLY HDAENLLMEE AGVAPLFFYT DPLIIDKNLQ DYVVTKLGFI YLQWASFK
|
| |