Gene Cthe_3232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3232 
Symbol 
ID4810272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3832532 
End bp3837541 
Gene Length5010 bp 
Protein Length1669 aa 
Translation table11 
GC content39% 
IMG OID640108666 
ProductYD repeat-containing protein 
Protein accessionYP_001039620 
Protein GI125975710 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGATAA AAATGATACC TGAAAAAATG ACAAAGATAG CAGATAATAT GAAGAGAATA 
TCGGAAAAAT TTGATGACAT AGTACAGGAT GTAAAAGGGG TAATATACTC CATTGATTGG
GAGCTCAGAA GCAAGGAAGG AATAGATCAG AAGTTGTTGA TTGCAGATCG GACGGCAAAA
AATATAGCCA ATGAATTGAG TAAAATGTCC CAGAACCTTA TAGAGGCAAG AGATCGAATG
ATAGAGGCTG ACAATAAAGC ATCGGCTGCA TCCAGAAAGA TGAAAATAGC GGATTTTATA
AAGTTGGTTA CTACAGTACT GTTGCCTGAT CCGTTACCTG GGTTGGCCAA TTATTTATTG
TGGAATCGAC TGATAGGTCG CGGAACTGCA AATTGCCCGA ACATTTTTGC GGGGGACCCT
GTAAATGTGG TATCGGGAAA TTTTTATTTA ACAAGAAGAG ATATAACCAT ACCGTCAAGA
GGTATGGCGA TTGAGATAAC CAGATATTAC AACTCCATGG ATAATACGGA AGGAATATTC
GGTAAAGGCT GGAGAACAGA TTATGAAACA TGCCTGAAGA AAAAGGAAGA CAGTGAAGAC
ATAATAGTTG TGTATCCGGA AGGGAACATA AGAGTATTTG AATATACCGA TACAGGGAGT
TTCAAGTCTC CGAAGGGAGT ATATGACACT CTTTTAAAAA CAGAGGACGG TACATACATA
TTAAAAGTTC AAAAAGGAAT TACCTACAAA TATGACCAGG CAGGAAGCCT TGTATCAATC
TTGGATTCAA ACAATAACGA GATAAGATTT AAATATAACT GGGAGGGATT GCTGTCTTCA
GTAATGTCAC CGGGAGGAAA ACTTTTGATG TTTTCCTATG AAGGTGGCAG GGTTGTCAGT
ATAACCGACC ATACGGGAAG GAATTTGAAA TATAAATATG ATGAAAAAGG AAATCTGACA
CAGGTAGTAT ACCCTGACGG AGGGAAGATT ACCTATGCAT ATGATAACAT AGGACTGATT
TCAATAACCG ACCAGAACGG CAACACCTAT GTCCAAAACA CCTATGATGA AAAGGGCAGA
GTAGTAAGGC AGCTTGACCA TGAGAACAAT GAATTGATTA TAGAATATGA CGAAGAAAAT
CGTGAAAATA CTTTCAAATG GACGAAAAGC GGTATAACCC GTGTATATAA ATACAATGAG
GAGTTGCTTC TGACCGAAGT AAGGTATGAT GACGGAAGCG TACAGAAATA CACGTATGAT
GAAAACCTTA ACAGAAACAG TGAGACGGAC AGAAATGGCA ACACGACGTA CAGGAAATAT
AATGACAAGG GCAATTTAAT AGAAGTAATT TCACCGGAGC CTTTCTGCTA TAAGACAAAA
TACAGTTATG ACGAGGAATG CAGACTGATA AAGATAATGT CGCCGGGCGG GGGAGAAGTG
TCTTTTGAAT ATGACGAAAG GGGAAACCTT TTAAAACGCA TTGTAAAGAC CGGAAGCAGA
AGTTATTCGG AGTGGTCATA TACGTATGAT CAATATGGAA GAATGACAGC ATCAAAAGAC
GCGGAAAACA ACACGAAGAC TTTTGAGTAT GGAGAAGAAG ATGTAAACAA ACCGACATTG
ATAAAAGATG CGGTAGGGAA CATATTTAAA TATGAATTTG ACAAAGTGGG GCGGGTAGTG
GCCACAACCA CAGATTACGG AACAGTAAGG TTAAAATATG ATGAGTGTGA CCGGATAACC
CACATAACCG ATACAGAAGG GAACACAACA AGAATCTGCT ATGACAAAGC AGGAAACATG
ACAAAGGTTA TAGCGCCGAA GCAGTATGGG GAGAAAGGCG AAAACGGGTC AGGATATGCA
TTTGAATATA ATGCAATGGA CAAGCTTATA AGGACAATTG ACCCGTTGGG CAATGTTTTT
GCGGTAAAAT ACGATGAGAA CGGCAACAAG ATAAAAGAGA TCAACCCGAA CTACTATAGT
TCTGAGAAAG ATGACGGTAT AGGAATAGAA TACAAATATG ACACCAACCA CCGCAGGATA
AACACAATAT TCCCGGACGG AAGCATGTCA AGGATAAAGT ATGACGCGGA AGGCAATATA
ATAAAGACGA TATCCTGGAA GGATTATAAC AAGGATTTGG ATGACGGGCC GGGGATGGAG
TATACCTATG ATGAAATGAA CAGGCTTACG CAAATAATAG ACCCGGAAGG GAATGTAATA
AAGAAATACA TATACGATGA AGACGGAAGA ATCGTGAAAG AAATAGACGC AAAAGGATAT
AGCAGTGCAG ATAACGATGA AGAACGTTGG GGTACAATAT ATAAATACAA CCTTGCCGGA
TGGCTTGTTG AGAAGAGGAC ACCGTTACAG CAGAAAAATG GTGAAATATA TTACAACATA
ATAGAATATG TGTATGACAG AAATGGAAGG GTAGTACAGG AGAAAAGATC TCCGGAATAT
GTGACCAAAA CAGAATATCC AAAGAAATGG AATATAATAA ACTATAAATA TGATCCCAAC
GGAAATCTGG CAGAGGTAAC CGACAGCCTT GGGGCGGTGA TAACCTATGA ATATGACTGC
TTTGGAAAAA GGACATTGGA GAAAATCAAG ATAAACGACA GGAAAGAGAG AGCAACGAGG
TATAAATATA ATGGTATTGG AAAACTGGTA AGAGTAATTA GGGAGCTGGA CGGAGAAGAC
CTTTCAGGCT ATGGTGAAGA GAAGGTTTTG ACGGAGACAA TATACAATTA CGATCCAAAC
GGTAATCTTA TAGAGGTGAT TTCTCCTGAA GGGTATTTGA CTGTATTTAA GTATGACGAT
GCGAACAGAA GGATAAAGAG TATATTGTAT CAACCGCAGA ACGGTGTGAA ACTGAGCGGC
AGTGCGTATT GTGCCCTTTT AAATACAAAG TCGAGGAGCA TAAGTTATGA GTATGACCGT
GCGGGGAACC TTGTAAAACA GATATTGCCT AACGGTGGCA TTATAATAAA CGAATATGAT
GAAATGAACA GAAGAATAAG AGTTACCGAC CCTGACGGAA ATACCAGAAG GATTTTCTAT
GACAATTCAG GAAATATCGT AAAATATGTA AATCCGGAGA ATTACGATCC GGAGAAAGAT
GACGGAACAG GTACCACATA TCTTTATGAC TCAATGAACC GTCTTATAGA AATAGTAAAT
GCAGCGGGTA TAGTAGTGGA AAGGAATATA TACAACACAG CGGGAGAGAT AATAAAGAGA
ATAGACTCAG TTGGTTATAG TTCCGCAGAT AATGACAATG ACAGGCATGG AGTTGAATTT
AGTTATGACC TGGCGGGACG TTTGGTGGAG ATAACAACGC CGGAAGCAAA GATTCATGGC
CGAAAGAGTC AGCAATACAC TTATGACGCA GAAGGAAACA TAACAGGAGT AGTTGACGGA
AACGGAAACA GCACAAGGTA CAGTTTGGAC CTGTGGGGTA AGGTAATAAA CATAACGGAA
CCCGACGGAA CCAATATAAA ATACGATTAT GACTATGCGG GAAATCTTGT ATCCACCACT
GACGGTAACG GAAACACCAC CCGTTATACA TACAACAGCT TGAACCTTCT GTCGGAGATA
ATAGATCCTG ACGGAAGGAA AATAACCTTC AAGTATGACA GACAGGGAAG AATGGTGCAG
AGGATAGGGA AAGACGGACG CAGCACATAT TATAATTACA ATGCGGATAA CAATATAACC
GGGCGTTGGG AAGAAGAAGG GCAGATGGAA AAATACGAGT ATAATGTAGA CGGAAGCCTG
GCTGCGTCAA TAAGCGGTAC TACTATACAT ACTTATGCCT ATACCTTGGC AGGAAGGCTG
AAAAGTAAGA CAACCAACGG ACAGAAGGTA TTGGAGTATG ATTACAATAA AAATGGGCTT
ATATCAAGGC TCACCGATAT AAGTGGAACA CCGGTGGAGT ATACATATGA CGTATTGGGG
AGATTAACAA CGGTAACAAA CGGAGGCAAA GTTTCTGCGA GGTATGAATA TAATATTGAC
AATACAATAG CACAGGTATT GTACGGAAGC GGAGTATGTG CGAGGTATGA ATACAACTTG
GATAAGATGA TAACAAGGCT TTTAAATATA GATCCGACAG GAAAAGAAAT GTTTGCATAC
AGGTATGCCT ATGACGGAAA CGGCAACCAG ATTTTGAAAG AAGAGAACGA TAAAGTAACG
GCCTACAGTT ATGATGCGCT GAACCGTTTG AAGGAAGTGG CATACCCTGG AAGTATAAAA
GAGAGATTTA TATATGATGC GAACGGTAAC AGGCTTAAGA GAGAATATGG AGACATATTT
GAGCAATATG AATATGATAG TTGTAATAGA TTGATTCAAA GAATAAAAAA CGGACTGTTA
ACGGAATATG AGTATGATGC GAGGGGAAAT TTGATAAAAG AAAAAGAGGG TGAGTTGACT
AAATTATACA GCTATGACGG ATTTGACAGA CTGATACGTG TACAAAATCC GGACGGAACA
TATATGGAAA ATATATACGA TGCCGAGAAT TTGAGAACGG TCTCGATAGA AAACGGTAGG
TACAACAGGT ATGTGTACAA CGGAAGAAAT ATAGCGTGTG AAGTAGACGA GGATTGGAGT
CTAAAAGACA GAATAGTCTT TGGGCATACG ATATTACAAA GAGAAGACAG TGACAAGAAT
GAGTATTATT ATATTCACAA TGCCCATGGG GATATTACAG CTCTTACCGA TGGGAAAGGA
GAAGTAATAA ACAGCTACAG TTACGATGCT TTTGGAAATA TATTGGACAG TGTTGAGAAG
ATAGAGAACA GATTCAAATA TTCGGGAGAA ATGCTTGATC CTGTTACGGG ACAATATTAC
CTGAGAGCGA GATATTATAA CCCAAGCATA GGAAGGTTTA TGCAGGAAGA TACCTTCAGA
GGAGACGGAC TCAATTTATA TACTTATGTT GCCAACAACC CGGTATTATG TTTTTTACGA
GGTACAAACA AAAAACATTG TAAACAGTAA
 
Protein sequence
MQIKMIPEKM TKIADNMKRI SEKFDDIVQD VKGVIYSIDW ELRSKEGIDQ KLLIADRTAK 
NIANELSKMS QNLIEARDRM IEADNKASAA SRKMKIADFI KLVTTVLLPD PLPGLANYLL
WNRLIGRGTA NCPNIFAGDP VNVVSGNFYL TRRDITIPSR GMAIEITRYY NSMDNTEGIF
GKGWRTDYET CLKKKEDSED IIVVYPEGNI RVFEYTDTGS FKSPKGVYDT LLKTEDGTYI
LKVQKGITYK YDQAGSLVSI LDSNNNEIRF KYNWEGLLSS VMSPGGKLLM FSYEGGRVVS
ITDHTGRNLK YKYDEKGNLT QVVYPDGGKI TYAYDNIGLI SITDQNGNTY VQNTYDEKGR
VVRQLDHENN ELIIEYDEEN RENTFKWTKS GITRVYKYNE ELLLTEVRYD DGSVQKYTYD
ENLNRNSETD RNGNTTYRKY NDKGNLIEVI SPEPFCYKTK YSYDEECRLI KIMSPGGGEV
SFEYDERGNL LKRIVKTGSR SYSEWSYTYD QYGRMTASKD AENNTKTFEY GEEDVNKPTL
IKDAVGNIFK YEFDKVGRVV ATTTDYGTVR LKYDECDRIT HITDTEGNTT RICYDKAGNM
TKVIAPKQYG EKGENGSGYA FEYNAMDKLI RTIDPLGNVF AVKYDENGNK IKEINPNYYS
SEKDDGIGIE YKYDTNHRRI NTIFPDGSMS RIKYDAEGNI IKTISWKDYN KDLDDGPGME
YTYDEMNRLT QIIDPEGNVI KKYIYDEDGR IVKEIDAKGY SSADNDEERW GTIYKYNLAG
WLVEKRTPLQ QKNGEIYYNI IEYVYDRNGR VVQEKRSPEY VTKTEYPKKW NIINYKYDPN
GNLAEVTDSL GAVITYEYDC FGKRTLEKIK INDRKERATR YKYNGIGKLV RVIRELDGED
LSGYGEEKVL TETIYNYDPN GNLIEVISPE GYLTVFKYDD ANRRIKSILY QPQNGVKLSG
SAYCALLNTK SRSISYEYDR AGNLVKQILP NGGIIINEYD EMNRRIRVTD PDGNTRRIFY
DNSGNIVKYV NPENYDPEKD DGTGTTYLYD SMNRLIEIVN AAGIVVERNI YNTAGEIIKR
IDSVGYSSAD NDNDRHGVEF SYDLAGRLVE ITTPEAKIHG RKSQQYTYDA EGNITGVVDG
NGNSTRYSLD LWGKVINITE PDGTNIKYDY DYAGNLVSTT DGNGNTTRYT YNSLNLLSEI
IDPDGRKITF KYDRQGRMVQ RIGKDGRSTY YNYNADNNIT GRWEEEGQME KYEYNVDGSL
AASISGTTIH TYAYTLAGRL KSKTTNGQKV LEYDYNKNGL ISRLTDISGT PVEYTYDVLG
RLTTVTNGGK VSARYEYNID NTIAQVLYGS GVCARYEYNL DKMITRLLNI DPTGKEMFAY
RYAYDGNGNQ ILKEENDKVT AYSYDALNRL KEVAYPGSIK ERFIYDANGN RLKREYGDIF
EQYEYDSCNR LIQRIKNGLL TEYEYDARGN LIKEKEGELT KLYSYDGFDR LIRVQNPDGT
YMENIYDAEN LRTVSIENGR YNRYVYNGRN IACEVDEDWS LKDRIVFGHT ILQREDSDKN
EYYYIHNAHG DITALTDGKG EVINSYSYDA FGNILDSVEK IENRFKYSGE MLDPVTGQYY
LRARYYNPSI GRFMQEDTFR GDGLNLYTYV ANNPVLCFLR GTNKKHCKQ