Gene Cagg_3576 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3576 
Symbol 
ID7269720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4342364 
End bp4348813 
Gene Length6450 bp 
Protein Length2149 aa 
Translation table11 
GC content52% 
IMG OID643568384 
ProductYD repeat protein 
Protein accessionYP_002464850 
Protein GI219850417 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.971607 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00854554 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATAGAG TATGGCGTTG GATCAACCTA CTCACCGTTT TTGTCGTGCT GTCGGGATTA 
TTGCCAACGC AAATGCCGGT TGTTGCTAAC GAGTCGCTTT CCGCTGCTTC CTTTCGTCGT
GTATCTCCTC CTCAGCAAGC ACCGTCCAAT CCACCACAAC CAGCAGTGTC TTCATACCGA
ATTTTTGTGC CACTTTTGCA GTCACCGGGC TTGCTCTTGG ATACTGCTGA ACAGGTACTG
AATACCGATC AACCAACCAC TTTAACATTG CTTAATAATC AGATACAAGT CAATTTTGCA
CCTACTACCG ATGAGCGAAC GATTCAAGCT ACTTTGACCG CATACAAAGG AACACCGGTT
ACAACTCCAG GAAGAGGACA GGCTGGGCCA GCGCTACGTC TGGTATTAGC TGATCGATTG
CATCCCGGCA ACGAGGTACG GCTGCCACCG ACGATCACAC CGCTGCCACG TCAACCGTTT
TATCCCGCGG ATACGCAAGT TACGCCTGGT ATTGTGTTGG AATGGGTGTA CAGTGACGCT
GATATTTGGG GTGTAGATGA GCGTTCATTA GGTCTCTACA GGCGTCAGGC GGCTACCGAT
TCGTGGCAGC GCGTGCCAAG TGCAGTAATT CCCGACCAAA ATCGTTTGAT TGCACATCTT
GAGACTGGCG GTGAGTATGC GCTGTTGGGT GAATTGCAAG TAGTGCAGTT GAACCAGCGC
ACCATGCGTG TAGCACTCGA TCCTGATGAC AATGATGGTT TTGCATTGTG GCCACAGATC
GGTCGTGTCG AAGAGATGAC CTACAACTGG CGCCTGGTAA CAGCCGTTGA GCAGCGGTTT
CGGTCGTCTG GTTGTCCGGT CAATATTCTC ATTACTCGCG ATGCGACACC GTTCGTCAAC
GAATCGCTGC GAGCAGCGGC GATCAACGGA TTCGGTGCCG ATATTGCAGT GACATTGGCG
TTCAATAGTT TTATAGGCAC GCCATGGGGT GGTCTGGGTG ATGGTGGCCC TATCGCCTTT
GCGCGAATCA ATGCACCGGC CGATCGGTCG CTTGCACAGC GGTTGCTTGA TAGCATGCGT
GATTACACCG GTCGTCGCTC GACCCGTCCG GTGTTAAGTC CGCTGCCTCA TCCAGAGTTT
AACAGTCTCA CTATGCCGTA TGCGCATCTT GAAGTGCTGT TCCTCGATCA TATCTTCGAC
TGGCCGGTTA TCAACACTGC ATTCGATCAG ATAGTCAATG CCGTCTATGC CGCACTGGTA
TCGGAACTTA CCCAGTATGG TTTGCTTTGC ACGCCGCCTG GTGGATCTAA CCCGACCCCA
CCATCGCTGT CAGCACGTCC CTCGGCTGAG CTGTTACTCC GATTGCGCAA TCTGGGGTAT
CAAAACTTTC AGCGGTACGG CATGGATCCG GTTAGTTTCT CAACCGGGAA TCACATCTTA
GTCCAACCGC TGGTGCGTAT TCCCGGTCGT GGTGGTCTTG ATATTGACTT GACCCTCGTG
TACAACTCGC AAGATCCCCG CTGCGATATT TTGGGGTGTG GTTGGAGTTT TCCCTATAAC
ATACGGTTGC AGCGCTACAG CGACGAGTCG GTCGCGGTGG TTTACCCCGA CGGTCGCACG
ATGCTCTATG AATGGACCGG GAGTGAGTAT CGTCCTCCAG CGGGCGGATA CGACCGGCTT
GAGCGGCGTG AAGATTTTTG GTTCCTTACC AGCCGAGATG GTGAGCAAAC ATGGCAGTTT
CAAGAGACGG TCACCGGCCT TGGGATTCTG GTCGCCTGGC GTGACCGGCG CGGCAATGCG
TTGACCTTCA CGCACGATCT GAGCGGCCAA GATGCTTGGC GGCGTGGTGA AGCTGTGCCA
CGTCCACCAC TGACGGCAAT TACCGACGCA ACTGGGCGGG TGATTCGGGT GGAAAATGAT
GCTGCCGGGC GAATTAGGGC TTTTGTCTTG CCCGACGGCC GCCGGTTCGA TCTCGAGTTC
GACGATCGTG GTGATCTTGT TGCGATTACC GATGCCAATA CCCCGACTCG TGGCACTTAT
CGGTTTGAGT ACGATGAGCG CCACCGAATC GTCAAGCAGT GGGATCCTGA GGGCATCCTC
TATTTGGTGA ACGAGTACGA TGACCGTGAT CGAGTGGTGC GACAAGTTGA TGCTAGTGGC
TCGGTCAGCC TGGCGAGCTA CGACCCGATT GCGCGCACGA CGGTATTCAC CGACAATCTT
GGTTTTCGGT ACGTGTATGC CTACGATGAA TTGTATCGGG TTATTGCTGA AACTGACCCA
CTCAATCGCA CGTCGCGAAC AGTGTATGAC GATCAATACA ATGTCGTTGC CTATACCGAT
GCTAATGGCC GGACGGTGCG AGCCAGCTAT GACCATCGTG GGCGATTGAC CGCTCTGCAT
GAGCCGGTGT CTGAGGGTAG TCTTGCTTGT CGTGAGCATT CCTACTCGGT TGATACAAAG
GTTTGGGAAT ACAACGGTAG TGGGGCAAAT GCCGATCTTC CGACGGCATT CATCGATGAG
CTTGGCCGAC GGTGGGAGTA CCGTTACGAT GATGAAGGCA ATCTGGTGCA GATCATCGCA
CCGATTGGGG CGATGTCGTT CCGCTTTGAT GAATGGGGGC AGCGGGTTGC CTCGATTGAT
GCAGCAGGTC GTGTGACTCA ATATTTCTAT GACGCCCACG GCAATCTTGC CCGCATCATC
GATCCGAAAA ACGGTACTAC GTCGTTTACC CATGACATTA CCGGGCGTCT GTTGAGCATC
ACCGATGCTA ACAATCGTAC CGTTAGCTTT GCATACTATG GCAATGACCT CATAGCAAAG
GTTACCGACG CCAAAGGCAA CGAGCTGTCC TTCGGCTACA ATCCGAACGG TCTCCTGACA
AGCCTCACCG ACCGCAACGG TGTAACTCGT CGCTTTAGCT ATGACGTGAA TCTTAATCTG
ATCGGTGAGT TGGATCATCC GAATGGGGCA TGGTTGACGT ATGCCTACGA CCGGTTGCAG
CGCTTGATCC GCACCACCGA TCGTAATGGT CATAGTACTG AATATCGCTA CGATCCGGCC
GGTCAGCTAC GTGAGGTAAT CGATCCGTTG GGGGCCATCT CTCGTTTCGA CTATGACGCG
GTTGGCAATC TCACATCAAT TACCGATTCG CTCGGTGGAG TTCAGACGAT TACTTACGAT
GCGGCGGACC GACCGGTAAC GGTCACAAAT CCAGACGGCA GTAGTGTGAC CTATTGCTAC
GATGCTGCCG ATCGGCTTGT GCGCGTCGTT GGGCCGCGAT CCGGTGAGTT CTATACGCTT
ACGTATGATG CCATGGATCG CTTGGTGGCG GTAACCAACG CGCTCGGTGC GACGGAGCAT
TTTGAGTATG ATGCTGTTGG CAATCCGATT GCCCAGATCG ATCCGTTGGG TAATCGTACC
GAATGGCGTT ACGATGAACT CGACCGATTG GTAGCGATAA TCGCGCCACC TCTCGCCGAT
GGCACGCGCC CTACTACCCA GTTTAGCTAT GACGCGGTCG GTAATCTAAC CAGTGCCACA
ACGCCACGTG GCTTCACGAT CCAATACTTC TACGATGAAA ACGGTAATCT AACCAAGGTC
ATTGACCCGT TGGATGCTGT CACACGCTAT CTCTACGATC CTGAAGATCG GCTAATCGCG
GCCACCGATC CTAACGGTCA TTCCATTCGC TATACCTACG ATCCGGTTGG TAATGTGGTA
GCGATGACCA ACGGTGCAGG TGAGAGTCTG CAATTCGTCT ATGATGCAGC GTACAACGTG
GTTGAACAGA TTGATCCGCT AGGGCGGGTT GTTCGATATG ACTACAACGA ACGCAACGAA
CTTGTGCGGG TGACCGATCC GCTGGGTAAT CAAACTTCCT ACTTGCGTGA TGCGTTAGGT
CGGGTGAGTG CCGTCGTGGA CGCCCTTGGC CGTCGTACCG ATTATGTGTA CGATCCGCTC
GGTCGCTTGT TGGCAGTCAT TGATCCACTG CGTAACCGTA CCGGCTATGA ATACAACGAG
GCCGGTGATC TTGTCGCCAT ACGTGACGCC AACGGTAATG TGAGCCGTTT CAACTTTGAT
GTCATTGGCC GACTCACCGG TGAAATTGAC CCCCTTGGTC GGCAGTGGCG CTATGTCTAC
GATGCAGCAA ACCGACCAAC CCAACGAATT GATGCGCTCG GACGGTCTAC CTTCTACGAT
TACGACAGTA ATGGGCGATT AACCGGTATT CGCTACAGCG TACCGCCGAC GGTTCAGTCA
CCGGTTACCA TTACCTATGA CCTTGACGGT AACGAGACAG CACGTTGCAC TGATCTCGGC
TGTGTGAGCC ATGCGTATAA TCCGCTCGGT ATGCCGGTTG AGACGGTGGA TTGGGCTGGA
CGAGTCGTAA AACGCAGTTT CGATGCTGCC GGCAATTTGG TGGAACTGAT CTATCCTGAT
GGTCGCCCGG TACGGTATGA CTACGACGCG GCGAATCGCT TAATCGGCAT TACACTGCCT
GATCAGCAGA AGAGTATTAT CGAACGCAAT AGAGCCGGTG AGGTTGCGCA GATTATCCAT
CCAAACGGGA TTCGCAGTAG CTTTAGCTAC GACGTTGCCG GACGCCTGAC GAAGATCAAC
CATCGTCGTG CAGATGCTGC CGAACCGCAG ACGGAGTTTG CCTACGCCCT TGATCGTGTC
GGTAATCGTG TGGCGGTCAG CGAAACGCGC GCTGCGTTTG ATGGTAGTAA TCGGCGGGTT
AACCTTACCC GAAGCTATAC CTACGACAAT AATAACCGGC TCATTCAGAC GATTAGTAAT
CTCGGTAGTG ATACCCGCTA TACCTTTGAT GCGGTCGGCA ATCGGATCGG TACAGACGGT
TCGCGCCTTA CGTCTGACGC TCGTCTACCC CAGTTACCGG TCAAACCTGA GGCAATTGAT
CAGCGCTACC GTTACGATGC GGCCAATCAA CTGATCGCTG ATGGAGATAC AACCATACAA
TACAACGCCA ACGGCGAGCG ACAACGCGAA GAGCGCCGTC TCCCCGATGG TCGGGTCACA
ACTACTGATT ACCAATTCGA TCTTGAAGGG CGATTGATCG GGGTAACGGT GCAAACCGCC
GGTGTCGTTC AGATGGAAGC CAGCTATACC TACGATGGCT ATGGTCGGCG CGCGACGAAG
ACGGTGCGCT ATCCCCAAAC CGGCTTACCG CCCGAAGTGA CCGAATATAC TTACGATGGC
CTTGACATCA TTGGTAGCGA GGTACGTCAA GGGTCGTTGG TCGCATCAAG TTCGTACTTC
CTGATGGAGT CACCGCTCGT GGCTTTGCGC CGCCCCTTTG CAATGGAGCG GCTTGACACC
GGTTCTACGT ACTGGTTCCA GACCGATGGC CTAGATTCAA TCGTTGGTAT CACGGATGAA
GGCGGGAATC TTGTTGGTCA GATGCTCTAT GACGAGTACG GTCAGCGTCT TGCCGGCGAT
CCAACCCTGC AACGATTCGC CTTCACAGCC CAAGCCTACG ATGCTGAGAC CGGTTTTGTC
CATTTCCATG CCCGTCTCTA CGACCCAGCC CGCGGTGTTT GGCTCAGCCC TGACCCCTAC
CGTGGCAATA TTGCTCTGCC CAATTCGCTG CATCGGTACG GCTATGTGGC CAATAATCCA
ACCGGTTGGA TTGATGCGTG GGGCTATGAC CGGCAAACCT CCGGCGGAAG TGGGGTCAGG
TTAGCGGTTG GCTCGAATTC TCGTGTGTCA GTTGACGTGT CTTATAGTGG TGGGAATCGG
TCGCCGGTGA ACCCGGGAAC CGGTAGTGCG GCAAAGAAAC TGAGTAAGGG CAAGGCTATG
GTTTTTTTTA AAAAGAATTC TGCTCCTATT CGTTTGAAAG ATATAACTGG TCGTCTTTAT
CCTGCCATAC GGAGCATGTT TAGTGGAACG TTTTTTGGAG GTCTATCACC TTTTTATGCT
GGACTATCAT ATGGAGCACG TTTAATAGAA GATTCTTTTG GAGATTCATT ACTTTTTTAT
GCTGGACATG TTGCTTGGGC TTTTCAAAAA GACGAGAATT GCTATGTTTA TGGTAGTACA
GATGGTCTTA GAAAGGAAGA TAGGGGGAAA TATTTTTGGG TAAAAGGTCT TGATGATTGT
ATAAAAGAAG ATGAAGTTAT CAATGATATG AAAGCACGTA ACCCATCTAG AAAAGTAGAA
GAATATGACG CATACAAAAT AATAGAAGTA GACGATCCAA ACACTAATAT TGAGTGGAAA
ATATCCGAGG TGGAAAAACG GCAATACGGC AATTTTTTTG GTAACTGTAT GGATGATACA
TACGATATAT TAACAGCGTA TGGTGCCAAA CTTCCACTGC CAATAACAAA ACCTCTTCCA
ACATGGTTCT TCGAAAGCAT AGCTGGTGAA GCTCAGGAGC TGAACCCGAA CCCGAAAATG
AACACCACAG ACCCAAAGGG GAAAAAATAG
 
Protein sequence
MHRVWRWINL LTVFVVLSGL LPTQMPVVAN ESLSAASFRR VSPPQQAPSN PPQPAVSSYR 
IFVPLLQSPG LLLDTAEQVL NTDQPTTLTL LNNQIQVNFA PTTDERTIQA TLTAYKGTPV
TTPGRGQAGP ALRLVLADRL HPGNEVRLPP TITPLPRQPF YPADTQVTPG IVLEWVYSDA
DIWGVDERSL GLYRRQAATD SWQRVPSAVI PDQNRLIAHL ETGGEYALLG ELQVVQLNQR
TMRVALDPDD NDGFALWPQI GRVEEMTYNW RLVTAVEQRF RSSGCPVNIL ITRDATPFVN
ESLRAAAING FGADIAVTLA FNSFIGTPWG GLGDGGPIAF ARINAPADRS LAQRLLDSMR
DYTGRRSTRP VLSPLPHPEF NSLTMPYAHL EVLFLDHIFD WPVINTAFDQ IVNAVYAALV
SELTQYGLLC TPPGGSNPTP PSLSARPSAE LLLRLRNLGY QNFQRYGMDP VSFSTGNHIL
VQPLVRIPGR GGLDIDLTLV YNSQDPRCDI LGCGWSFPYN IRLQRYSDES VAVVYPDGRT
MLYEWTGSEY RPPAGGYDRL ERREDFWFLT SRDGEQTWQF QETVTGLGIL VAWRDRRGNA
LTFTHDLSGQ DAWRRGEAVP RPPLTAITDA TGRVIRVEND AAGRIRAFVL PDGRRFDLEF
DDRGDLVAIT DANTPTRGTY RFEYDERHRI VKQWDPEGIL YLVNEYDDRD RVVRQVDASG
SVSLASYDPI ARTTVFTDNL GFRYVYAYDE LYRVIAETDP LNRTSRTVYD DQYNVVAYTD
ANGRTVRASY DHRGRLTALH EPVSEGSLAC REHSYSVDTK VWEYNGSGAN ADLPTAFIDE
LGRRWEYRYD DEGNLVQIIA PIGAMSFRFD EWGQRVASID AAGRVTQYFY DAHGNLARII
DPKNGTTSFT HDITGRLLSI TDANNRTVSF AYYGNDLIAK VTDAKGNELS FGYNPNGLLT
SLTDRNGVTR RFSYDVNLNL IGELDHPNGA WLTYAYDRLQ RLIRTTDRNG HSTEYRYDPA
GQLREVIDPL GAISRFDYDA VGNLTSITDS LGGVQTITYD AADRPVTVTN PDGSSVTYCY
DAADRLVRVV GPRSGEFYTL TYDAMDRLVA VTNALGATEH FEYDAVGNPI AQIDPLGNRT
EWRYDELDRL VAIIAPPLAD GTRPTTQFSY DAVGNLTSAT TPRGFTIQYF YDENGNLTKV
IDPLDAVTRY LYDPEDRLIA ATDPNGHSIR YTYDPVGNVV AMTNGAGESL QFVYDAAYNV
VEQIDPLGRV VRYDYNERNE LVRVTDPLGN QTSYLRDALG RVSAVVDALG RRTDYVYDPL
GRLLAVIDPL RNRTGYEYNE AGDLVAIRDA NGNVSRFNFD VIGRLTGEID PLGRQWRYVY
DAANRPTQRI DALGRSTFYD YDSNGRLTGI RYSVPPTVQS PVTITYDLDG NETARCTDLG
CVSHAYNPLG MPVETVDWAG RVVKRSFDAA GNLVELIYPD GRPVRYDYDA ANRLIGITLP
DQQKSIIERN RAGEVAQIIH PNGIRSSFSY DVAGRLTKIN HRRADAAEPQ TEFAYALDRV
GNRVAVSETR AAFDGSNRRV NLTRSYTYDN NNRLIQTISN LGSDTRYTFD AVGNRIGTDG
SRLTSDARLP QLPVKPEAID QRYRYDAANQ LIADGDTTIQ YNANGERQRE ERRLPDGRVT
TTDYQFDLEG RLIGVTVQTA GVVQMEASYT YDGYGRRATK TVRYPQTGLP PEVTEYTYDG
LDIIGSEVRQ GSLVASSSYF LMESPLVALR RPFAMERLDT GSTYWFQTDG LDSIVGITDE
GGNLVGQMLY DEYGQRLAGD PTLQRFAFTA QAYDAETGFV HFHARLYDPA RGVWLSPDPY
RGNIALPNSL HRYGYVANNP TGWIDAWGYD RQTSGGSGVR LAVGSNSRVS VDVSYSGGNR
SPVNPGTGSA AKKLSKGKAM VFFKKNSAPI RLKDITGRLY PAIRSMFSGT FFGGLSPFYA
GLSYGARLIE DSFGDSLLFY AGHVAWAFQK DENCYVYGST DGLRKEDRGK YFWVKGLDDC
IKEDEVINDM KARNPSRKVE EYDAYKIIEV DDPNTNIEWK ISEVEKRQYG NFFGNCMDDT
YDILTAYGAK LPLPITKPLP TWFFESIAGE AQELNPNPKM NTTDPKGKK