Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3232 |
Symbol | |
ID | 4810272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3832532 |
End bp | 3837541 |
Gene Length | 5010 bp |
Protein Length | 1669 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 640108666 |
Product | YD repeat-containing protein |
Protein accession | YP_001039620 |
Protein GI | 125975710 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGATAA AAATGATACC TGAAAAAATG ACAAAGATAG CAGATAATAT GAAGAGAATA TCGGAAAAAT TTGATGACAT AGTACAGGAT GTAAAAGGGG TAATATACTC CATTGATTGG GAGCTCAGAA GCAAGGAAGG AATAGATCAG AAGTTGTTGA TTGCAGATCG GACGGCAAAA AATATAGCCA ATGAATTGAG TAAAATGTCC CAGAACCTTA TAGAGGCAAG AGATCGAATG ATAGAGGCTG ACAATAAAGC ATCGGCTGCA TCCAGAAAGA TGAAAATAGC GGATTTTATA AAGTTGGTTA CTACAGTACT GTTGCCTGAT CCGTTACCTG GGTTGGCCAA TTATTTATTG TGGAATCGAC TGATAGGTCG CGGAACTGCA AATTGCCCGA ACATTTTTGC GGGGGACCCT GTAAATGTGG TATCGGGAAA TTTTTATTTA ACAAGAAGAG ATATAACCAT ACCGTCAAGA GGTATGGCGA TTGAGATAAC CAGATATTAC AACTCCATGG ATAATACGGA AGGAATATTC GGTAAAGGCT GGAGAACAGA TTATGAAACA TGCCTGAAGA AAAAGGAAGA CAGTGAAGAC ATAATAGTTG TGTATCCGGA AGGGAACATA AGAGTATTTG AATATACCGA TACAGGGAGT TTCAAGTCTC CGAAGGGAGT ATATGACACT CTTTTAAAAA CAGAGGACGG TACATACATA TTAAAAGTTC AAAAAGGAAT TACCTACAAA TATGACCAGG CAGGAAGCCT TGTATCAATC TTGGATTCAA ACAATAACGA GATAAGATTT AAATATAACT GGGAGGGATT GCTGTCTTCA GTAATGTCAC CGGGAGGAAA ACTTTTGATG TTTTCCTATG AAGGTGGCAG GGTTGTCAGT ATAACCGACC ATACGGGAAG GAATTTGAAA TATAAATATG ATGAAAAAGG AAATCTGACA CAGGTAGTAT ACCCTGACGG AGGGAAGATT ACCTATGCAT ATGATAACAT AGGACTGATT TCAATAACCG ACCAGAACGG CAACACCTAT GTCCAAAACA CCTATGATGA AAAGGGCAGA GTAGTAAGGC AGCTTGACCA TGAGAACAAT GAATTGATTA TAGAATATGA CGAAGAAAAT CGTGAAAATA CTTTCAAATG GACGAAAAGC GGTATAACCC GTGTATATAA ATACAATGAG GAGTTGCTTC TGACCGAAGT AAGGTATGAT GACGGAAGCG TACAGAAATA CACGTATGAT GAAAACCTTA ACAGAAACAG TGAGACGGAC AGAAATGGCA ACACGACGTA CAGGAAATAT AATGACAAGG GCAATTTAAT AGAAGTAATT TCACCGGAGC CTTTCTGCTA TAAGACAAAA TACAGTTATG ACGAGGAATG CAGACTGATA AAGATAATGT CGCCGGGCGG GGGAGAAGTG TCTTTTGAAT ATGACGAAAG GGGAAACCTT TTAAAACGCA TTGTAAAGAC CGGAAGCAGA AGTTATTCGG AGTGGTCATA TACGTATGAT CAATATGGAA GAATGACAGC ATCAAAAGAC GCGGAAAACA ACACGAAGAC TTTTGAGTAT GGAGAAGAAG ATGTAAACAA ACCGACATTG ATAAAAGATG CGGTAGGGAA CATATTTAAA TATGAATTTG ACAAAGTGGG GCGGGTAGTG GCCACAACCA CAGATTACGG AACAGTAAGG TTAAAATATG ATGAGTGTGA CCGGATAACC CACATAACCG ATACAGAAGG GAACACAACA AGAATCTGCT ATGACAAAGC AGGAAACATG ACAAAGGTTA TAGCGCCGAA GCAGTATGGG GAGAAAGGCG AAAACGGGTC AGGATATGCA TTTGAATATA ATGCAATGGA CAAGCTTATA AGGACAATTG ACCCGTTGGG CAATGTTTTT GCGGTAAAAT ACGATGAGAA CGGCAACAAG ATAAAAGAGA TCAACCCGAA CTACTATAGT TCTGAGAAAG ATGACGGTAT AGGAATAGAA TACAAATATG ACACCAACCA CCGCAGGATA AACACAATAT TCCCGGACGG AAGCATGTCA AGGATAAAGT ATGACGCGGA AGGCAATATA ATAAAGACGA TATCCTGGAA GGATTATAAC AAGGATTTGG ATGACGGGCC GGGGATGGAG TATACCTATG ATGAAATGAA CAGGCTTACG CAAATAATAG ACCCGGAAGG GAATGTAATA AAGAAATACA TATACGATGA AGACGGAAGA ATCGTGAAAG AAATAGACGC AAAAGGATAT AGCAGTGCAG ATAACGATGA AGAACGTTGG GGTACAATAT ATAAATACAA CCTTGCCGGA TGGCTTGTTG AGAAGAGGAC ACCGTTACAG CAGAAAAATG GTGAAATATA TTACAACATA ATAGAATATG TGTATGACAG AAATGGAAGG GTAGTACAGG AGAAAAGATC TCCGGAATAT GTGACCAAAA CAGAATATCC AAAGAAATGG AATATAATAA ACTATAAATA TGATCCCAAC GGAAATCTGG CAGAGGTAAC CGACAGCCTT GGGGCGGTGA TAACCTATGA ATATGACTGC TTTGGAAAAA GGACATTGGA GAAAATCAAG ATAAACGACA GGAAAGAGAG AGCAACGAGG TATAAATATA ATGGTATTGG AAAACTGGTA AGAGTAATTA GGGAGCTGGA CGGAGAAGAC CTTTCAGGCT ATGGTGAAGA GAAGGTTTTG ACGGAGACAA TATACAATTA CGATCCAAAC GGTAATCTTA TAGAGGTGAT TTCTCCTGAA GGGTATTTGA CTGTATTTAA GTATGACGAT GCGAACAGAA GGATAAAGAG TATATTGTAT CAACCGCAGA ACGGTGTGAA ACTGAGCGGC AGTGCGTATT GTGCCCTTTT AAATACAAAG TCGAGGAGCA TAAGTTATGA GTATGACCGT GCGGGGAACC TTGTAAAACA GATATTGCCT AACGGTGGCA TTATAATAAA CGAATATGAT GAAATGAACA GAAGAATAAG AGTTACCGAC CCTGACGGAA ATACCAGAAG GATTTTCTAT GACAATTCAG GAAATATCGT AAAATATGTA AATCCGGAGA ATTACGATCC GGAGAAAGAT GACGGAACAG GTACCACATA TCTTTATGAC TCAATGAACC GTCTTATAGA AATAGTAAAT GCAGCGGGTA TAGTAGTGGA AAGGAATATA TACAACACAG CGGGAGAGAT AATAAAGAGA ATAGACTCAG TTGGTTATAG TTCCGCAGAT AATGACAATG ACAGGCATGG AGTTGAATTT AGTTATGACC TGGCGGGACG TTTGGTGGAG ATAACAACGC CGGAAGCAAA GATTCATGGC CGAAAGAGTC AGCAATACAC TTATGACGCA GAAGGAAACA TAACAGGAGT AGTTGACGGA AACGGAAACA GCACAAGGTA CAGTTTGGAC CTGTGGGGTA AGGTAATAAA CATAACGGAA CCCGACGGAA CCAATATAAA ATACGATTAT GACTATGCGG GAAATCTTGT ATCCACCACT GACGGTAACG GAAACACCAC CCGTTATACA TACAACAGCT TGAACCTTCT GTCGGAGATA ATAGATCCTG ACGGAAGGAA AATAACCTTC AAGTATGACA GACAGGGAAG AATGGTGCAG AGGATAGGGA AAGACGGACG CAGCACATAT TATAATTACA ATGCGGATAA CAATATAACC GGGCGTTGGG AAGAAGAAGG GCAGATGGAA AAATACGAGT ATAATGTAGA CGGAAGCCTG GCTGCGTCAA TAAGCGGTAC TACTATACAT ACTTATGCCT ATACCTTGGC AGGAAGGCTG AAAAGTAAGA CAACCAACGG ACAGAAGGTA TTGGAGTATG ATTACAATAA AAATGGGCTT ATATCAAGGC TCACCGATAT AAGTGGAACA CCGGTGGAGT ATACATATGA CGTATTGGGG AGATTAACAA CGGTAACAAA CGGAGGCAAA GTTTCTGCGA GGTATGAATA TAATATTGAC AATACAATAG CACAGGTATT GTACGGAAGC GGAGTATGTG CGAGGTATGA ATACAACTTG GATAAGATGA TAACAAGGCT TTTAAATATA GATCCGACAG GAAAAGAAAT GTTTGCATAC AGGTATGCCT ATGACGGAAA CGGCAACCAG ATTTTGAAAG AAGAGAACGA TAAAGTAACG GCCTACAGTT ATGATGCGCT GAACCGTTTG AAGGAAGTGG CATACCCTGG AAGTATAAAA GAGAGATTTA TATATGATGC GAACGGTAAC AGGCTTAAGA GAGAATATGG AGACATATTT GAGCAATATG AATATGATAG TTGTAATAGA TTGATTCAAA GAATAAAAAA CGGACTGTTA ACGGAATATG AGTATGATGC GAGGGGAAAT TTGATAAAAG AAAAAGAGGG TGAGTTGACT AAATTATACA GCTATGACGG ATTTGACAGA CTGATACGTG TACAAAATCC GGACGGAACA TATATGGAAA ATATATACGA TGCCGAGAAT TTGAGAACGG TCTCGATAGA AAACGGTAGG TACAACAGGT ATGTGTACAA CGGAAGAAAT ATAGCGTGTG AAGTAGACGA GGATTGGAGT CTAAAAGACA GAATAGTCTT TGGGCATACG ATATTACAAA GAGAAGACAG TGACAAGAAT GAGTATTATT ATATTCACAA TGCCCATGGG GATATTACAG CTCTTACCGA TGGGAAAGGA GAAGTAATAA ACAGCTACAG TTACGATGCT TTTGGAAATA TATTGGACAG TGTTGAGAAG ATAGAGAACA GATTCAAATA TTCGGGAGAA ATGCTTGATC CTGTTACGGG ACAATATTAC CTGAGAGCGA GATATTATAA CCCAAGCATA GGAAGGTTTA TGCAGGAAGA TACCTTCAGA GGAGACGGAC TCAATTTATA TACTTATGTT GCCAACAACC CGGTATTATG TTTTTTACGA GGTACAAACA AAAAACATTG TAAACAGTAA
|
Protein sequence | MQIKMIPEKM TKIADNMKRI SEKFDDIVQD VKGVIYSIDW ELRSKEGIDQ KLLIADRTAK NIANELSKMS QNLIEARDRM IEADNKASAA SRKMKIADFI KLVTTVLLPD PLPGLANYLL WNRLIGRGTA NCPNIFAGDP VNVVSGNFYL TRRDITIPSR GMAIEITRYY NSMDNTEGIF GKGWRTDYET CLKKKEDSED IIVVYPEGNI RVFEYTDTGS FKSPKGVYDT LLKTEDGTYI LKVQKGITYK YDQAGSLVSI LDSNNNEIRF KYNWEGLLSS VMSPGGKLLM FSYEGGRVVS ITDHTGRNLK YKYDEKGNLT QVVYPDGGKI TYAYDNIGLI SITDQNGNTY VQNTYDEKGR VVRQLDHENN ELIIEYDEEN RENTFKWTKS GITRVYKYNE ELLLTEVRYD DGSVQKYTYD ENLNRNSETD RNGNTTYRKY NDKGNLIEVI SPEPFCYKTK YSYDEECRLI KIMSPGGGEV SFEYDERGNL LKRIVKTGSR SYSEWSYTYD QYGRMTASKD AENNTKTFEY GEEDVNKPTL IKDAVGNIFK YEFDKVGRVV ATTTDYGTVR LKYDECDRIT HITDTEGNTT RICYDKAGNM TKVIAPKQYG EKGENGSGYA FEYNAMDKLI RTIDPLGNVF AVKYDENGNK IKEINPNYYS SEKDDGIGIE YKYDTNHRRI NTIFPDGSMS RIKYDAEGNI IKTISWKDYN KDLDDGPGME YTYDEMNRLT QIIDPEGNVI KKYIYDEDGR IVKEIDAKGY SSADNDEERW GTIYKYNLAG WLVEKRTPLQ QKNGEIYYNI IEYVYDRNGR VVQEKRSPEY VTKTEYPKKW NIINYKYDPN GNLAEVTDSL GAVITYEYDC FGKRTLEKIK INDRKERATR YKYNGIGKLV RVIRELDGED LSGYGEEKVL TETIYNYDPN GNLIEVISPE GYLTVFKYDD ANRRIKSILY QPQNGVKLSG SAYCALLNTK SRSISYEYDR AGNLVKQILP NGGIIINEYD EMNRRIRVTD PDGNTRRIFY DNSGNIVKYV NPENYDPEKD DGTGTTYLYD SMNRLIEIVN AAGIVVERNI YNTAGEIIKR IDSVGYSSAD NDNDRHGVEF SYDLAGRLVE ITTPEAKIHG RKSQQYTYDA EGNITGVVDG NGNSTRYSLD LWGKVINITE PDGTNIKYDY DYAGNLVSTT DGNGNTTRYT YNSLNLLSEI IDPDGRKITF KYDRQGRMVQ RIGKDGRSTY YNYNADNNIT GRWEEEGQME KYEYNVDGSL AASISGTTIH TYAYTLAGRL KSKTTNGQKV LEYDYNKNGL ISRLTDISGT PVEYTYDVLG RLTTVTNGGK VSARYEYNID NTIAQVLYGS GVCARYEYNL DKMITRLLNI DPTGKEMFAY RYAYDGNGNQ ILKEENDKVT AYSYDALNRL KEVAYPGSIK ERFIYDANGN RLKREYGDIF EQYEYDSCNR LIQRIKNGLL TEYEYDARGN LIKEKEGELT KLYSYDGFDR LIRVQNPDGT YMENIYDAEN LRTVSIENGR YNRYVYNGRN IACEVDEDWS LKDRIVFGHT ILQREDSDKN EYYYIHNAHG DITALTDGKG EVINSYSYDA FGNILDSVEK IENRFKYSGE MLDPVTGQYY LRARYYNPSI GRFMQEDTFR GDGLNLYTYV ANNPVLCFLR GTNKKHCKQ
|
| |