Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5146 |
Symbol | |
ID | 5737104 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 203332 |
End bp | 210456 |
Gene Length | 7125 bp |
Protein Length | 2374 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641282311 |
Product | YD repeat-containing protein |
Protein accession | YP_001547902 |
Protein GI | 159901656 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCTTG TGCCACGGTG GCTTGCCCAT CTGTTGCTCA TCACGCTGTT GGGTTCGGGT GTTCCACGCG CATCTATGCG TCCAACCGTG CTCGCCAATC CGCAGCCGCA GCCGGCCACG GCCCCGGTCA CCCCGAACAA CGAGCTGCCG CCCTTGCCTG ATTTGCCGCC CGTCGCCCTC CCGCTGGTGG ATACACCCAC CCTACGTCCG ACCATTCCGG TGACCACGAC CGCACCATCA CCATTTGGCC CGTTATTGAC CCGCATCAGT GATCCCGCCA TTGCCCCATC GCCAACGCGT CCGCGTCCGT TGGAGGCTGA TCAGACCCGT GCACCATCAA CCCCAGCGGT TACGCCGCAG GATGTGAGTT CCTGTACTGG CGATGCGACC AATCCCCGCG CGTGGTGGCA ACGGGATAGC ACCAACCAGT ATGAATTTCT GAATCCCAGT AACGTGGATT GGTCGCATGA TCCGACGGGT ACCACCCTGC AACGGCGCTT TTTCCAGACC GATCGCTCGG TCGGCATTGT CATTTGTCTC CGCCCCACCG AGCAAATCGC CAGCTGGGTG CCCAATGCAA CCACCTGGAC GACCACGTGG TATCGGAGTG CCACTCCGAC CAGCACTGTG CCAACCATCG AGTTTGGCCG CGTCACGATG ACCTATCGCG GCGTTCAGAA TGGGACGCTC CTCTTTACCG CTCGCTGGAA TACGGGGAAC GAGGTTCCCT TCGATCAGAA CTGTACGGCG ACGGCCAACC CTTGTGCAAC CCTCGCGCCC CTCAATATTG ATGCGCTCGA TCTCAAACGC TATCCTGGCA TGTGGCGAGC CGTGGTGACG AGTACCACCG GCATGACCGC CACCCCACCC GATAGCGTGA TGATTAACTT AACCGTCACC GACTATGAGG CCAGCCCCAA TCCGATCAAT CGGGCAACCC CGACCACCTT GACGACCCGG TTCCAAGCAA CCCCACCCAC GCGGAATGCA CAGGGTCAGC CGTCCATGCC CTTCTTGGGC TATGACTATG ATCTCTATAC CCCGGCGGGT CAGCATATCA TGACCTTGCC CCATCCCACC CCGGGCTTTG ACCCGAGTAC CGCGACCGCC AGTTTCGCCA CGAGTTGGAA TAACCAACTC ACCAATCCCA CCCGCCCGAT TACCGATGGC ACCTATCTCG TCTCCTTCAT GTTTGCCTTT GGGGAACCGT TGTGGCTAGG CTGGAGCGAA CCCGCCTACA TTCAAGCGAT CTGGCCACTC TATGTTGGTG TGCCCGCCAC CCAATTGCGC AACAAAGGTT CCTGTGAATT TGGCTGTTCC CCTGCGTCCT ACCAAGTCGT GGCGGGTGAT CCGGTCAATA CCGCCACGGG CAATTTTCTG CACAGCGAAA CCGACCTCTC CATCCCCACC GAGGGGGAGC CGCTCACCTG GACGCGGTCG TATAACAGTC AAGCCCTGAC CGAGACCTCG GTGCTTGGCC CCGGCTGGAC GCATCCCTTC CACCTTCGGC TGGTCTTGCC AACCAGTCCG TTGGGGGTCT CCAATCAAAT CAGTCTGATC AGTCCGTCGG GGAATCACTA TACTTTTCGG CAATCCGGAT CGACCTATGT CCCCCTCCCC GGAGTCTATG CCACCCTCAC GGCTGATGCG GGTGGCTATA CCGTGAGCCT TCCCGACCTA ACCCGCTATC GCTTCGATCT GCTGGGTCGC GTGACCACGG TTACGACCAG TCGTGGGGCG AGTTTGCAGT ACCAGTATGT CACCAGTGGC CCCTATACCG GCTTGCTTGA CCGGATCACG GATACCAGTG GCCAACGCTT CTTGGCGTTG ACCTATACCG GTGAGAGCGC CGCCACCGCC CGCCTTGCCA GTGTCCGCGA CCCGCTCAAT CGAACGGTGA CCTATGGCTA TCAGGATGGC ATGTTGACCA CCGTGACCGA TGTCATGGGG CGCATCACCA CCTATGGCCT GACCGATGGC CTGATTACCA CGATTGATCA CGCGGGCGTG CGCACCCTCA CCAATGTCTA TGAAACGACC ACCGCCAGCG TTCCCGTTGG GAATCCCTCC GCCTGGTTGT ATGCCGCTGC CCCAGCCTTA ACCGCCTGGC CCCCACCGAG TCAACGGCGG GTCATCGCGC AACGCAGTGG CGATGGGGTT GAAACCACCT ATACCTATGC GGCGGATGGC ACGACCATGG CCACCCGTGA CGGCAATCGG GTCTATCAGG AAAAACACTG GTATCGCCCG GATGGCTCCT TGGATCATAT CAGCCATGAA ACCTCTCTCG GCACCATCAC CACCGAACAG GCAACCTTCG ATCCGGTGAC GCTCAGTCGC ACGGCGACGA CGGATCGCTA TGGCAATACC CTGAGCCAAA CGACCAATGC CCGTGGCCAA GCGACCTCTA TGACCGATGC CACAGGGGGC GGCGTGACCA CCACCTATCG AGCAGCAACC GAGGCCATTG CCCCCCAGCA GCCAGCGCTC GTTCGCGATT CCCTCGGACG GGAAACCCGC TATCAGTATG GGCCGGGCGA TGTGCTGACC AGTGTCACCA CCGGGATTGC GCCGACAACC CCGCAGGGCC GCACCACCAC CTATACCTAT GACCAACGCT ACCCGGGCAA GCATTGGCTC GAAGCCGTCA CCGATCCGCT GGGGCACACC ACGACCTATA CCTATAATGC TTGGGGGCAA ATCACCGCCG TCACCGATGC CGCTGGCGCG ACCACCACGA CCACCTACGA TGCGGTGGGC CGCGTCAGTG CCACGACGGA TGCACGGGGA ACGGTGACCC GCTATACCTA TCATCCCGAT GATACCGTGG CCATGGTCGT GCGTAACTGT ACCGACGCAG CGGGGGTTCC CAGCCCTACC GGGCCGTGTG CCGCCCAAAC CGCTGAGCGC AATCTGACCA CGACCTATGG TTATGACCCC CGCGGTCGGC AGGTCTGGGA ACGCTCCCCG CTCGGCACCT ATACCGCCCA ACGCTATACC GCTGCGGGCC ATGTCGCATG GCGCACCAAG GGCTTGGTCG CCGCAGGCTT TCCCGCCACC CTCCCGACCA CCCCACCCGC CTATGATGCC CGCTGGCCCG ATCAAAACAG TACCACCTTC TATGCCTATG ATAGTTTTGG CCGCACGACC TTCATTACTG AAACGGGCAT CGTGACGGGG TCGCTCATGC TGGATTCCAG CCTCGGCTGG CGCTGGACGG CGACAACCCA GCGCGTGACC CGCTTGACCT ATGATGCCTT TGGTCGGCCA TGGCAGGTGA TGCGGCAGTA TGACCCGCAA CAGCCGCTCA CGGGGCGGAG CGATGTCAAT CTGACCACCA CCTATACCTA CGATGATACG GCGACACCCC GTGCCACAAT GATCTGTGAC CCGCTGAATC GCTGTACCCG CACGGCGTAT GATGCCCTTG GCCGCGTCGT CTCGGTCATC GAGCGCTATC AGGATGGCAT TCCTGGCCCG GATGCCGCCG ATCGCCAAAC CGTCACCACC TATACTCCAG CGGGACTGGT CGATACGGTG ATTACGGACT ACCACGATGG TGTCTATGCG CCAGCCACCG ATCCGCTGGC CGACCTGAAA ACGGCCTATA CCTATGATAG CTGGGGTCGC GCGACCCGTG TAACCACGGC CGTTGATCCG GCCCCGATCA GTGGGGCAAC CGATGTCAAT CGGACGACCC AGCGCTGGTA TGATGCCAAT GACCGCCTGT ATGCCGAACA AGATGCCATG GGTCGCATCA CCCGCTATCG CTCCAATGCG GTGGGGCAGG TCATCGAAAC GGTCACCAAC TGCACGGGCG CGACAGCAAC GCTGCTCACC GGCACCTGTG ATGCCTTTAC CCCCAGCCAG CCGGATCGCA ATCTGCGGGT GTGGATGCAG TATACGGCGC TCGGCCAAAC CGAGGCGGTC ACGACCATGC TGGATCTGAC GGCACGGCGC ACGACAGTGT ACCGCTATGA TCCCCTGGGG CGGATGACTG TCCAGGTCGA CCAGTGTACG GCCAATGGGC AACCCGTCAT TGCGGGCTGT GATACCCCCA ATCCCACGGA TACCACCCAG AGTGATACCA ACCGTCGGCA GTTGTGGATC TATGATGCGG AAGGGAATAT CGTCGACCAA CGGAATCCAC GTGGGTTTCG CGAGACCTTT ACCTATGATC GCCTGAACCA GCAACGCAGT CGAACATACT ACTTCATCGG CATCACCTGC GAGCCGACGA CCATCACCTG CGCGACCACC CGCCAACAGG CCGATGGAAC GGGCCAAGTA CGCTGGCAGG TTGATGAAAC CGGGATGGAT GCCACTGATC TGCGCCGGAA TCTCGTAGCC ACGCATGTTG ATCCCGTGGG ACGGGTCGTG CGGACGGTTG CTGCTGCCGA CAATACGGGC CAATTCGGGA TCGTCACGAC CACCCAGTAT GATCGTGGGA GCCGCGCCAT TGCCACCACC AATGCCGATG GGCAAACGAC GCTGCTGGGC TATCGGGGCG ATGATCGGGT CACCACCGTC GTCGAGAATG CAACCGGACC CGTCGCGCCC ATCGCCGTCA CGACAACGGC CCAATACGAT GTTCATGGCA ATCTGCGCAG CCTCACCGAT GCCGAGCAGC ATACCCGCCG CTGGCAGTAT GACAGTCGGG GCTTGGTGCT GAGCCAACAG GATGCCGCCA ATCGCACGAC CACCATGACC TACGATACGG GTGGGCGGAT GCGCACGAAA ACTGACCCGC GCGGCAGTGC GATGGCCGTC AGCTACACCT ATGACCTGCG TGATCGGGTG ACCCAGATCA GCTCGCCTGG CCTCGCAACC ACGATTACCA TGCAGTATGA TGCCAGCGGT CGGCGCACCC AGATGACCGA TGGGAGTGGG ACAACAAGCT GGACGTATGA CCGCGCGGAT CGCGTCGCGA CGACCACCCA ACCCGTCGTT GGCGGTCTCC AGTATGGCTA TGATGCGACC GACCAACGGG TTGCACTGAC CTATCCGAGT GGTGGCCCGC AGCTGGGCTA TGGCTATGAT ATGGATGGGT TGATCAACGC GATTGATACC AATCGTGATG GCACACGCGA TATTAACTAT ACGCACACCT TCGATGGTCA GCTGACCAGC ATTGAACCGA TTGGCCAGCC ATTTACCCTC CAATGGACCT ATGATGCGCT TGATCGGGTC ACCAGCATCG ACCAAGGTCG AGCTGCCGCG CAACGCCAAC CCGATCACCG CACGCGGGGA CAGCGTGCTC AGCCGCAGGA TCTCGTCCCC ACGACCACGT TTGGCATGCG CTATACCCTC AGTGCCGCTG GTCGCCACGA GGCCATCACC GAATGGCAAC AGGGGCCGGG CACCGCACTC CCGCTCCCCG TAGGAGCCTA CCGCCAATAT CTCCCGCTGG TCCTGGATGA GCGGCTGGAT GGGTGGAGTT TGCAGTATGA CGGCCTCAAT CGTCTGACGT TTGCGACGCT TAGCCGACCA GGCGATGGAA CGCCCGCGCG GGGATTGGTT GCCGATGAAA CCGAAGGGGC AACCTATGAT CGGGTCGGCA ATCCACTCAC GCGCCAGCGC AATGGGAGCA CGAGTGGGCC ATGGAGCTAT ACGGCTGCCG ATCAACGCAC CGATTGGACG TATGATGCGG CGGGGAATGT GCTCAACGAT GGCACCGCGA CGTATACCTA TGATGCCTTG GGTCGCTTAA CCAGCCGCAC GCAAGGCGGC ACGACCAGCA CCCATACCTA CAATGGCGAT GGCCTGCTGG TCGCCAGCAC GACGAATGGC GCAACGACCC GATTCCTGTG GGATACCACG ACCCCGAATG CCCAATTGGT CGGCACCCAG CAAGCGAGCG GCACCACCTG GTATGTCTGG TCGCCGATGG GCGGCGGGAC GGCGCGGGTG CTCTATAGCC TTGGGCCAAG TGGCCGTCGG TGGCTGATGA GTGACGGGAT TGGATCGGTC CGCCGCACGC TGAGCGATAG CGGAACCGTG ATCCAAACTC GGCAGTGGAC CCCGTTTGGG GTGGAAATCG GCGGCACCGC CAGTGCCGGA CTGGGCTATG CGGGAGAATG GCAAGAAGCC AGCGGCTTGG TGTATCTGCG CGCACGGTGG TATGATCCGG CAGCGAGTCG CTTCCTGAGC CGTGACCCGT GGGATGGGAT GATCAGCAAT CCCCAGACCC TCAATCCCTA TGCCTATGCC CATAACCAAC CCACCCGCTT TACCGATCCC AGCGGGAAAA CGCCCCTCCT TGCGGCGTTG GCAGGAGTTG AAATTGGATT ACCAATCCTA TCGGGATTGG GGGCTGCGGC GGCAGCCTGT TATCTCAGCA TTGTCTGTGG TGTTGCCGTC TTAACCGCCG TGGTAGTTGG TGCGTTTGGC CTCGCCTATC TCTGTCAGCA GGGAGTTCTT TGCCAACCTG ATGCAGCGAC ACAACCATCC GATTATATTG ATACGGATGA AGAACTCTTT AACAACGTTG ATTGTGACTA TATTGAACTA CCAGACGGGA ATAACGAAGG TCCCGACGGA TTACCACCAC TCAGGCCTGG CTTAAAATTT GCCGCAATAA TCTTATTGAC TACGGTAGGG GTAGTCAATC TGAGCCAGAT GATTCCCCGA TCTGCACCAC AATCTACACC TGAATCAGCT CCTACGGCAC CTATTTATAC TGACACAAAT CCTAAAGATA ATGATAGAGC CAAACAAGTC TATTACAGAG GCTTAAGTAA CAGGGATTTA AGAGAATTTA ATACCTATGG TATGATTCGG AGTAAATTCA TTAGAGACGG GGGGAAAGTA GAAGATGCGG CTGATATGAT CAATAATCCC CAAGCAAGGG AAAATCATCC TTATGGATCG GGAGGGTCAC CATTTGTTTC AGTAACAAAG AATCCTACAA TGGCGATGAA GTTTGCAACA AAATACTTAG ATGATCCAAC AGATAATGGA TCTAGCGGCT ACATTCTTAG AATTACGACA ACACGAAAAT ACTATACCTC TCCAACAAGT TTTGGCCATG AACAAGAAGC TCTCGCACCT ATTCTGATAG GCGGAATATT TGATACCTTA GAGATCAATC CACAGCAAGG TGTAATTCCA CCTGGATGGA ATTGA
|
Protein sequence | MSLVPRWLAH LLLITLLGSG VPRASMRPTV LANPQPQPAT APVTPNNELP PLPDLPPVAL PLVDTPTLRP TIPVTTTAPS PFGPLLTRIS DPAIAPSPTR PRPLEADQTR APSTPAVTPQ DVSSCTGDAT NPRAWWQRDS TNQYEFLNPS NVDWSHDPTG TTLQRRFFQT DRSVGIVICL RPTEQIASWV PNATTWTTTW YRSATPTSTV PTIEFGRVTM TYRGVQNGTL LFTARWNTGN EVPFDQNCTA TANPCATLAP LNIDALDLKR YPGMWRAVVT STTGMTATPP DSVMINLTVT DYEASPNPIN RATPTTLTTR FQATPPTRNA QGQPSMPFLG YDYDLYTPAG QHIMTLPHPT PGFDPSTATA SFATSWNNQL TNPTRPITDG TYLVSFMFAF GEPLWLGWSE PAYIQAIWPL YVGVPATQLR NKGSCEFGCS PASYQVVAGD PVNTATGNFL HSETDLSIPT EGEPLTWTRS YNSQALTETS VLGPGWTHPF HLRLVLPTSP LGVSNQISLI SPSGNHYTFR QSGSTYVPLP GVYATLTADA GGYTVSLPDL TRYRFDLLGR VTTVTTSRGA SLQYQYVTSG PYTGLLDRIT DTSGQRFLAL TYTGESAATA RLASVRDPLN RTVTYGYQDG MLTTVTDVMG RITTYGLTDG LITTIDHAGV RTLTNVYETT TASVPVGNPS AWLYAAAPAL TAWPPPSQRR VIAQRSGDGV ETTYTYAADG TTMATRDGNR VYQEKHWYRP DGSLDHISHE TSLGTITTEQ ATFDPVTLSR TATTDRYGNT LSQTTNARGQ ATSMTDATGG GVTTTYRAAT EAIAPQQPAL VRDSLGRETR YQYGPGDVLT SVTTGIAPTT PQGRTTTYTY DQRYPGKHWL EAVTDPLGHT TTYTYNAWGQ ITAVTDAAGA TTTTTYDAVG RVSATTDARG TVTRYTYHPD DTVAMVVRNC TDAAGVPSPT GPCAAQTAER NLTTTYGYDP RGRQVWERSP LGTYTAQRYT AAGHVAWRTK GLVAAGFPAT LPTTPPAYDA RWPDQNSTTF YAYDSFGRTT FITETGIVTG SLMLDSSLGW RWTATTQRVT RLTYDAFGRP WQVMRQYDPQ QPLTGRSDVN LTTTYTYDDT ATPRATMICD PLNRCTRTAY DALGRVVSVI ERYQDGIPGP DAADRQTVTT YTPAGLVDTV ITDYHDGVYA PATDPLADLK TAYTYDSWGR ATRVTTAVDP APISGATDVN RTTQRWYDAN DRLYAEQDAM GRITRYRSNA VGQVIETVTN CTGATATLLT GTCDAFTPSQ PDRNLRVWMQ YTALGQTEAV TTMLDLTARR TTVYRYDPLG RMTVQVDQCT ANGQPVIAGC DTPNPTDTTQ SDTNRRQLWI YDAEGNIVDQ RNPRGFRETF TYDRLNQQRS RTYYFIGITC EPTTITCATT RQQADGTGQV RWQVDETGMD ATDLRRNLVA THVDPVGRVV RTVAAADNTG QFGIVTTTQY DRGSRAIATT NADGQTTLLG YRGDDRVTTV VENATGPVAP IAVTTTAQYD VHGNLRSLTD AEQHTRRWQY DSRGLVLSQQ DAANRTTTMT YDTGGRMRTK TDPRGSAMAV SYTYDLRDRV TQISSPGLAT TITMQYDASG RRTQMTDGSG TTSWTYDRAD RVATTTQPVV GGLQYGYDAT DQRVALTYPS GGPQLGYGYD MDGLINAIDT NRDGTRDINY THTFDGQLTS IEPIGQPFTL QWTYDALDRV TSIDQGRAAA QRQPDHRTRG QRAQPQDLVP TTTFGMRYTL SAAGRHEAIT EWQQGPGTAL PLPVGAYRQY LPLVLDERLD GWSLQYDGLN RLTFATLSRP GDGTPARGLV ADETEGATYD RVGNPLTRQR NGSTSGPWSY TAADQRTDWT YDAAGNVLND GTATYTYDAL GRLTSRTQGG TTSTHTYNGD GLLVASTTNG ATTRFLWDTT TPNAQLVGTQ QASGTTWYVW SPMGGGTARV LYSLGPSGRR WLMSDGIGSV RRTLSDSGTV IQTRQWTPFG VEIGGTASAG LGYAGEWQEA SGLVYLRARW YDPAASRFLS RDPWDGMISN PQTLNPYAYA HNQPTRFTDP SGKTPLLAAL AGVEIGLPIL SGLGAAAAAC YLSIVCGVAV LTAVVVGAFG LAYLCQQGVL CQPDAATQPS DYIDTDEELF NNVDCDYIEL PDGNNEGPDG LPPLRPGLKF AAIILLTTVG VVNLSQMIPR SAPQSTPESA PTAPIYTDTN PKDNDRAKQV YYRGLSNRDL REFNTYGMIR SKFIRDGGKV EDAADMINNP QARENHPYGS GGSPFVSVTK NPTMAMKFAT KYLDDPTDNG SSGYILRITT TRKYYTSPTS FGHEQEALAP ILIGGIFDTL EINPQQGVIP PGWN
|
| |