Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1371 |
Symbol | |
ID | 4809366 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 1664451 |
End bp | 1670330 |
Gene Length | 5880 bp |
Protein Length | 1959 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640106795 |
Product | YD repeat-containing protein |
Protein accession | YP_001037796 |
Protein GI | 125973886 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATAA AAGTATTGCC GGAAAAGATG ACTGAAATTG CGAGCAGTTT AAAGCGCTTG TCCGATGAAT TTGATGGTAT AATACGGGAT ATAAACAGTA TAGTAAAGTC AATTGACTGG GAACTGAGAA GCAAAGAAGG GGTAGATCAG AAAGCGTCGG ATGCGATTAG AGTTGCAAAG AAAATATCCG GAAGTCTTGA GTCAATGGCA AAAGACCTTG AGTTTGCCCG AGACAGAATG ATTGAAGAAG ATAAAAAAGC ATCAAACATT GCCGGGAAAA TGAAGAGTGC GGTCATCGGA GCTTCTGTGG CTACAGCTTC CGCCGGAAAT GCAAAAGTTG CGGATGTGAA TTATACATCG GATTACAAAA ACTTAGGTCC CGGACGAGCA AACTGCCCGA ATACCTTTGT CGGTGACCCT GTAAACGTAA CAACGGGAAA CTTCTATGTG ACTAAAAAGG ATATCCAAAT CCCCACAAGA GGTATTCCCC TTGAAATAAG GCGATATTAC AATTCCATTG ACCAAACCTG CGGAATTTTG GGAAAGGGCT GGAGAATAGG ATATGAAACC GGCCTTATGA GAGTGGAAGA CAGTCAGGAT ATCCTGACGG TGTATCCTGA CGGAAGCATA GGCGTATTTG AGTTTGATGA AAAAGACAAT AAATATATTG CCCCACGGGG AATTTTTGAT ATACTACAAA AGAACGAAGA TGAAAGCTTT ACGTTGAAAC TTCATGACGG AACTACCTAC AGATATGACA AATCCGGAAA TCTCACATCA ATAAGCGACC TTAACGGTAA TGCAATTTCC ATAAAATATA ACGGCCAAGG TATGATTTCA TCGGTAATAT CACCGGGCGG AAAATTTCTG GATTTTTCAT ATGAAGATTT AAAACTGAGA AAAATAACTG ACCATACCGG TCGGGAGATA ATTTACTCCT ATGATAAATC CGGCAACCTT ATTCAGGTAA AATATCCTGA CGGAGGAATC ATTAAATACG GATATGACAA TAAAGGCATG ATATCTATCA CCGACCAGAA CGGAAACACA TATGTCCAAA ATACATACGA CGAAGCAGGC CGGGTTGTAA AACAACTTGA CCATGAAGGA AACGAGCTGG TTATAGAATA CTGCCCGGGA GAGTGCAAGA ATATATTTAA ATGGCAGAAA AGCGGTATAA CCCGCATATA CAAATACAAC GAGGAAAAGT TGCTGACGGA AATCATATAT GATGACGGTA CCAAAGAAGT ATACACCTAT GATGAGGATA AAAACAGAAA CAGCATAACC GACAGAAATG GCAGGACAAG AAGGTTTAAG TATGATGAAA GAGGGAATCT CATAGCCGAA ATAATGCCGG AACCTTTCAA CTATACCGTA CGATACAGCT ATGACGGAAA CAACAGAAGG ACAAAAATCA GCACTCCGGC AGGCGGTCTT GCAAGATTTG AGTATGACGA AAAAGGAAAT CTCTTAAAAC GTATTGTGAA AATAAACAGC AATACCAATT CCGAAACTGT GTATACATAT GATGAATTCG GAAGAGTTTT GACAATTACC GATGCCGAGA ACAACACCAC ATCCTTTGAA TATAATGATG ACGATATAAA CAAGCCCTCT GCAATAATTG ACCCGGAAGG AAACAGATTT ACCTATGATT TTGATGCCAT TGGAAGAGTT GTGGCGATAA CAACGGGTTA TGGAACTGTA AAAATAGAAT ATAATGAGCG TGATAAGATA ACAGGTTTGA TTGATGCGGA AAATAACAAG ATCCGAATAA AATATGATGC CGTCGGAAAC ATGGTTGAAG TTATTGCTCC GGAACAATAT GCGCAAAAAG GCGATAAAGC TCAAAGCTAT ACCTTTGCTT ATGATGCAAT GGACCGTATG ATAAAGCAGA TTGACCCCTT GGGAAACGTA TTTATGGTAA AATACGATGA GCACGGAAAT AAAATCAAGG AGGTCAATCC CAATTACTAC AATGAAGAGG AAGACGACGG CATAGGAATG GTCTATAAAT ATGACTCCAG CCATAGGCAA ATAAACACAA TTTTCCCTGA CGGAAGCAAA TCCAGAATAA AGTATGACCC TCAAGGGAAT ATAATAAAAA CCATATTGCC GGGCGATTAT AACGAAGAAA CCGATGACGG TCCGGGAATG CAGTTTACGT ATGATGAAAT GGACCGGCTT GACAAGATCA TTGACCCGAA TGGAAATGTA ATTGCAAAAT ATTTGTATGA TGAGGACGGC AGAGTAATAA AGGAAATAAA TGCAAAAGGG TACAGTAGCG CGGACAATGA TGAAGAACGC TGGGGCACAT TGTACAAATA TAATCTTGCC GGCTGGCTTG TTGAAAAAAG GACTCCCCTT GAAAGCATAA ACGGGCAAAT CTTTTACAAT GTAACTGAAA ATGTCTATGA CAGAAACGGC CGGCTTGTAC AGCAAAAAAT TTCTCCTGAA TATGTGACCA AAACCGGCTA TCCAAAGACA TGGAATATCA TAAGCTATAA GTATGACAAA AACGGAAGAA TTATAGAAGT ATCGGACAGC ATTGGTTCAT GTGTGCAGTA TGAGTACGAC TGTCTTGGTA AGAAGACTTT GGAAAAGAAA AAAATAAATG ACAATACATA TAAAATAACC AGATTTGAAT ACAGCAGTGC CGGAAGACTC AAAAAAGTAA TTGAAGAGTA TGATGGAAAA GATTTGGCGG GCGAAGAAAA AGGAACCGTG AAAGCAGAGA CCTTGTTTGA GTATGACAGA AACGGCAATA TTACAGCGGT TATTTCCCCG GAAGGGTACA GAAAAAGATT CATTTATGAT GCGGCAAACA GAAGAATAGG TGTTGAAGAA TATTTGCCTC CGGAAGGAGT GAAAATATCA GGAAATGCCG TCAATGCACT GTTAAAAACT ATAGTCAGAA AGACACTGTA CAGCTATGAT AAAGCCGGAA ACCTTGTACA GCAGAGGCTT CCCAACGGAA GAACCGTTGA AACCGAATAT GACGAAATGA ACAGAAGAAT CAGGATAAAA GATGCGGAGG GAAATATAAC AAGGCTGTTT TATGATGCGT CAGGCAATTT AATAAAATAT GTGGAACCGG AAAATTACGA TCCAAAAATC GATGACGGTC CGGGAACATC CTATTTTTAT GATTCAATGA ATCGGCTTTT ACAGGTGACA AACGCGGCGG GTATTGTTGT GGAAAGAAAT ATATACAACA CGGCCGGAGA AATAATAAAG AAAATTGACG CAAGAGGATA TCTGGCTGCA GCGAATGATA ATGACAGATA TGGTGTGGAA TACGGATATG ATCCGGGAGG AAGACTTAGA TATATTACAA CGCCGGAGGC AAAAGCCAAA GGAATTGTAA GCCAGCAATA CAACTATAAC TCTTTGGGGT ATATTACGGA AATTATCGAC GGAAAAGGAA ACAAAACCGA ATACACTCTT GATTTATGGG GAAAAATCAG GGAAGTTCAC GAGGCTACCG GTTCTGTTTT CAGATACGAA TATGACTATG CAGGAAATCT CACAGCAGTA ATTGACGGAA ACGGAAATGT GACGCGGTAC AATTACAACA GTCTCAATAT TCTTTCAGAG ATTATTGATC CACTGGGCGG CAGGATCTCT TATAAATATG ACAGGCAGGG AAGGATGGTT TGGACTTCTG ACAGGGAAGG AAGGGTAACA CATTACAGAT ACAACTTTGA CGATAAACTT GTGAGCATTT GGAGTGAAAA CGGAATATTT GAAAAATATG AATATAATCT TGACGGAAGT CTTGCAGCTT CGATAAGTGA CAGGACGATT CATTCATATA CCTATACTCC GTCGGGAAGA CTCAAGAAAA AGAACACAAA TGGTGTTACG GTTTTGGATT GCGAGTATGA CAAAAGCGGA CGGGTGACAA AACTTACCGA TGTAAGCGGA AAAACGATAG AGTATACTTA TGATATTCTT GGCAGACTCA CGAATGTAAT AAATGAAGGA AGAAAAACAG CTCAATACGA ATATAATCCG GACAACACCA TATCCAGGAT TCTATATGGC AGCGGCGTTT TTGCAAAATA TGACTATGAC ATGGATAAAA AGATAATTGG AATTTTGAAT GTTGATCCTT TTGGCCGGGA ACTCTTTAAC GGAAAGTATT TCTATGATAC CAACGGTAGC CAAATAAAGA AGGAAGAAAA CGGCAGGGTA ACGTTATACG GCTATGACAG CGTAAACAGA CTTGAGAAAG TTTCTTATCC TGAAGGAATC GATGAGAAAT TTGCTTATGA TAATGCAGGA AACAGGATTG CCCGGGAATT TGGCAAGTTA CTCGAGAATT ATAAATATGA CAAAAGCAAC AGATTGATAC AGAAAGTGTC CAATGGCATA GTGACAGATT ATGAATATGA TGCCGGGGGC AACCTTGTAA AGGAAATTGA AGGAGAAAGT GTAAGAAGAT TTGAGTATGA TGACTTTAAA AGGCTTGTAA AAGTAATAAA TCCTGACGGT ACATATATGG AAAACATATA TGATGCCGAG GGAATGAGAG TGCAGACGGT AGAAAACGGT GAATACCGAA GATTCATATT TGACGGAAAT AATGCAATAG CCGAGGTTGG AGAAGATTGG AGTTTAAAGG GCAGGAACGT TAGAGGACAT GCATTGTTGG AATTGGAAGA CGAAAACAAT AACACATATC ACTATTTGCA CAATGCACAT GGCGACGTGG CCAATCTTGC GGACAGTTCG GGAAAGATTG TAAACACTTA TGATTATGAT GCTTTCGGTA ACACTTTAAG CGTAAAGGAA ACAATACATA ACAGATTTCG CTATGCGGGA GAACAGTATG ATGATTTTAC AGGTCAGTAT TATTTGAGGG CAAGGTTCTA TAGTCCGTCT TTAGGGCGGT TCACCCAGGA AGATACATGG AGAGGTTTCA CATATAATCC TGCAAGTTTG AACCTATACA CATATGTTGA AAACAATCCT GTAATGTTTG TCGACCCTAC AGGACATTGG CCCAAATTTA TTGACAATGC GTTGGATTGG GTCGGTAATA AAGTCAATGA AGCTGCCGAC TGGGTGGGAA ACAGAGTAAA TGATGTTGTT GACTGGGCAG GCGACAGAAT CAATGATGCA CGCAATTTCA TAACAAATAC GGCAACAGGC GTTAAAAACT GGTGGGTTGA AAACAATGTC GGAGCATATG TTGTCGGAGG TTTGAAAATA TTAGGCGGGA TTGCCATAGG TGCGGGAGCA ATTGCAGTTA CTGTTTTAAC CTGTGGAGCA AGTGCACCGC TGACCGGTGC ATTGATTGGA GCGGGTATAA GTTTGGCGGC AACTTATGCA ACGGATGTTG TCGGCAACTT TATGAAAAAT GACTGGAAAT GGTCCGCATC CAATTTCCTG CCGTCATCCT CACCGGGTGA ATATATAGCA AATATGGTCT CGGGTGCTGT TGGTGGAATG CTGCCGGGAG GTTTTGGCTT CTGGAAGAGT TTGGGGCTAT TGGGTGTGGA TGCTGTAATG GCTGCAGGTA TATCCACGTT GATAGATTCA GGAATTGAAG ATATACGTAC AGGAAAGATA AATGAAATTG ATATAGAAAA AGGCTTCGAG GACGCAATGC TGAATTTTGC CGGAAATATT GTCGGAGCTG TAGCGGCAGG ACTTGTTCCG GATGTTTATG TACCTGCGTC AAGACAAAAA GCAAGGTATA TATTGAGAAA AGCAAATCAA GCATACAATG GTCCTGCTGC CAGAACTTTT ATTAAGAAAA TGCAATTCCG GAAACAAATA TTTGACTTAT TGGGAAATAT TATTGATGAG CTGGTAGGTG GAATAACATC GGAAATAATT GAAGGACACG GTAAGTGTAC TGCAAATTAG
|
Protein sequence | MRIKVLPEKM TEIASSLKRL SDEFDGIIRD INSIVKSIDW ELRSKEGVDQ KASDAIRVAK KISGSLESMA KDLEFARDRM IEEDKKASNI AGKMKSAVIG ASVATASAGN AKVADVNYTS DYKNLGPGRA NCPNTFVGDP VNVTTGNFYV TKKDIQIPTR GIPLEIRRYY NSIDQTCGIL GKGWRIGYET GLMRVEDSQD ILTVYPDGSI GVFEFDEKDN KYIAPRGIFD ILQKNEDESF TLKLHDGTTY RYDKSGNLTS ISDLNGNAIS IKYNGQGMIS SVISPGGKFL DFSYEDLKLR KITDHTGREI IYSYDKSGNL IQVKYPDGGI IKYGYDNKGM ISITDQNGNT YVQNTYDEAG RVVKQLDHEG NELVIEYCPG ECKNIFKWQK SGITRIYKYN EEKLLTEIIY DDGTKEVYTY DEDKNRNSIT DRNGRTRRFK YDERGNLIAE IMPEPFNYTV RYSYDGNNRR TKISTPAGGL ARFEYDEKGN LLKRIVKINS NTNSETVYTY DEFGRVLTIT DAENNTTSFE YNDDDINKPS AIIDPEGNRF TYDFDAIGRV VAITTGYGTV KIEYNERDKI TGLIDAENNK IRIKYDAVGN MVEVIAPEQY AQKGDKAQSY TFAYDAMDRM IKQIDPLGNV FMVKYDEHGN KIKEVNPNYY NEEEDDGIGM VYKYDSSHRQ INTIFPDGSK SRIKYDPQGN IIKTILPGDY NEETDDGPGM QFTYDEMDRL DKIIDPNGNV IAKYLYDEDG RVIKEINAKG YSSADNDEER WGTLYKYNLA GWLVEKRTPL ESINGQIFYN VTENVYDRNG RLVQQKISPE YVTKTGYPKT WNIISYKYDK NGRIIEVSDS IGSCVQYEYD CLGKKTLEKK KINDNTYKIT RFEYSSAGRL KKVIEEYDGK DLAGEEKGTV KAETLFEYDR NGNITAVISP EGYRKRFIYD AANRRIGVEE YLPPEGVKIS GNAVNALLKT IVRKTLYSYD KAGNLVQQRL PNGRTVETEY DEMNRRIRIK DAEGNITRLF YDASGNLIKY VEPENYDPKI DDGPGTSYFY DSMNRLLQVT NAAGIVVERN IYNTAGEIIK KIDARGYLAA ANDNDRYGVE YGYDPGGRLR YITTPEAKAK GIVSQQYNYN SLGYITEIID GKGNKTEYTL DLWGKIREVH EATGSVFRYE YDYAGNLTAV IDGNGNVTRY NYNSLNILSE IIDPLGGRIS YKYDRQGRMV WTSDREGRVT HYRYNFDDKL VSIWSENGIF EKYEYNLDGS LAASISDRTI HSYTYTPSGR LKKKNTNGVT VLDCEYDKSG RVTKLTDVSG KTIEYTYDIL GRLTNVINEG RKTAQYEYNP DNTISRILYG SGVFAKYDYD MDKKIIGILN VDPFGRELFN GKYFYDTNGS QIKKEENGRV TLYGYDSVNR LEKVSYPEGI DEKFAYDNAG NRIAREFGKL LENYKYDKSN RLIQKVSNGI VTDYEYDAGG NLVKEIEGES VRRFEYDDFK RLVKVINPDG TYMENIYDAE GMRVQTVENG EYRRFIFDGN NAIAEVGEDW SLKGRNVRGH ALLELEDENN NTYHYLHNAH GDVANLADSS GKIVNTYDYD AFGNTLSVKE TIHNRFRYAG EQYDDFTGQY YLRARFYSPS LGRFTQEDTW RGFTYNPASL NLYTYVENNP VMFVDPTGHW PKFIDNALDW VGNKVNEAAD WVGNRVNDVV DWAGDRINDA RNFITNTATG VKNWWVENNV GAYVVGGLKI LGGIAIGAGA IAVTVLTCGA SAPLTGALIG AGISLAATYA TDVVGNFMKN DWKWSASNFL PSSSPGEYIA NMVSGAVGGM LPGGFGFWKS LGLLGVDAVM AAGISTLIDS GIEDIRTGKI NEIDIEKGFE DAMLNFAGNI VGAVAAGLVP DVYVPASRQK ARYILRKANQ AYNGPAARTF IKKMQFRKQI FDLLGNIIDE LVGGITSEII EGHGKCTAN
|
| |