Gene Haur_5146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5146 
Symbol 
ID5737104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp203332 
End bp210456 
Gene Length7125 bp 
Protein Length2374 aa 
Translation table11 
GC content59% 
IMG OID641282311 
ProductYD repeat-containing protein 
Protein accessionYP_001547902 
Protein GI159901656 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCTTG TGCCACGGTG GCTTGCCCAT CTGTTGCTCA TCACGCTGTT GGGTTCGGGT 
GTTCCACGCG CATCTATGCG TCCAACCGTG CTCGCCAATC CGCAGCCGCA GCCGGCCACG
GCCCCGGTCA CCCCGAACAA CGAGCTGCCG CCCTTGCCTG ATTTGCCGCC CGTCGCCCTC
CCGCTGGTGG ATACACCCAC CCTACGTCCG ACCATTCCGG TGACCACGAC CGCACCATCA
CCATTTGGCC CGTTATTGAC CCGCATCAGT GATCCCGCCA TTGCCCCATC GCCAACGCGT
CCGCGTCCGT TGGAGGCTGA TCAGACCCGT GCACCATCAA CCCCAGCGGT TACGCCGCAG
GATGTGAGTT CCTGTACTGG CGATGCGACC AATCCCCGCG CGTGGTGGCA ACGGGATAGC
ACCAACCAGT ATGAATTTCT GAATCCCAGT AACGTGGATT GGTCGCATGA TCCGACGGGT
ACCACCCTGC AACGGCGCTT TTTCCAGACC GATCGCTCGG TCGGCATTGT CATTTGTCTC
CGCCCCACCG AGCAAATCGC CAGCTGGGTG CCCAATGCAA CCACCTGGAC GACCACGTGG
TATCGGAGTG CCACTCCGAC CAGCACTGTG CCAACCATCG AGTTTGGCCG CGTCACGATG
ACCTATCGCG GCGTTCAGAA TGGGACGCTC CTCTTTACCG CTCGCTGGAA TACGGGGAAC
GAGGTTCCCT TCGATCAGAA CTGTACGGCG ACGGCCAACC CTTGTGCAAC CCTCGCGCCC
CTCAATATTG ATGCGCTCGA TCTCAAACGC TATCCTGGCA TGTGGCGAGC CGTGGTGACG
AGTACCACCG GCATGACCGC CACCCCACCC GATAGCGTGA TGATTAACTT AACCGTCACC
GACTATGAGG CCAGCCCCAA TCCGATCAAT CGGGCAACCC CGACCACCTT GACGACCCGG
TTCCAAGCAA CCCCACCCAC GCGGAATGCA CAGGGTCAGC CGTCCATGCC CTTCTTGGGC
TATGACTATG ATCTCTATAC CCCGGCGGGT CAGCATATCA TGACCTTGCC CCATCCCACC
CCGGGCTTTG ACCCGAGTAC CGCGACCGCC AGTTTCGCCA CGAGTTGGAA TAACCAACTC
ACCAATCCCA CCCGCCCGAT TACCGATGGC ACCTATCTCG TCTCCTTCAT GTTTGCCTTT
GGGGAACCGT TGTGGCTAGG CTGGAGCGAA CCCGCCTACA TTCAAGCGAT CTGGCCACTC
TATGTTGGTG TGCCCGCCAC CCAATTGCGC AACAAAGGTT CCTGTGAATT TGGCTGTTCC
CCTGCGTCCT ACCAAGTCGT GGCGGGTGAT CCGGTCAATA CCGCCACGGG CAATTTTCTG
CACAGCGAAA CCGACCTCTC CATCCCCACC GAGGGGGAGC CGCTCACCTG GACGCGGTCG
TATAACAGTC AAGCCCTGAC CGAGACCTCG GTGCTTGGCC CCGGCTGGAC GCATCCCTTC
CACCTTCGGC TGGTCTTGCC AACCAGTCCG TTGGGGGTCT CCAATCAAAT CAGTCTGATC
AGTCCGTCGG GGAATCACTA TACTTTTCGG CAATCCGGAT CGACCTATGT CCCCCTCCCC
GGAGTCTATG CCACCCTCAC GGCTGATGCG GGTGGCTATA CCGTGAGCCT TCCCGACCTA
ACCCGCTATC GCTTCGATCT GCTGGGTCGC GTGACCACGG TTACGACCAG TCGTGGGGCG
AGTTTGCAGT ACCAGTATGT CACCAGTGGC CCCTATACCG GCTTGCTTGA CCGGATCACG
GATACCAGTG GCCAACGCTT CTTGGCGTTG ACCTATACCG GTGAGAGCGC CGCCACCGCC
CGCCTTGCCA GTGTCCGCGA CCCGCTCAAT CGAACGGTGA CCTATGGCTA TCAGGATGGC
ATGTTGACCA CCGTGACCGA TGTCATGGGG CGCATCACCA CCTATGGCCT GACCGATGGC
CTGATTACCA CGATTGATCA CGCGGGCGTG CGCACCCTCA CCAATGTCTA TGAAACGACC
ACCGCCAGCG TTCCCGTTGG GAATCCCTCC GCCTGGTTGT ATGCCGCTGC CCCAGCCTTA
ACCGCCTGGC CCCCACCGAG TCAACGGCGG GTCATCGCGC AACGCAGTGG CGATGGGGTT
GAAACCACCT ATACCTATGC GGCGGATGGC ACGACCATGG CCACCCGTGA CGGCAATCGG
GTCTATCAGG AAAAACACTG GTATCGCCCG GATGGCTCCT TGGATCATAT CAGCCATGAA
ACCTCTCTCG GCACCATCAC CACCGAACAG GCAACCTTCG ATCCGGTGAC GCTCAGTCGC
ACGGCGACGA CGGATCGCTA TGGCAATACC CTGAGCCAAA CGACCAATGC CCGTGGCCAA
GCGACCTCTA TGACCGATGC CACAGGGGGC GGCGTGACCA CCACCTATCG AGCAGCAACC
GAGGCCATTG CCCCCCAGCA GCCAGCGCTC GTTCGCGATT CCCTCGGACG GGAAACCCGC
TATCAGTATG GGCCGGGCGA TGTGCTGACC AGTGTCACCA CCGGGATTGC GCCGACAACC
CCGCAGGGCC GCACCACCAC CTATACCTAT GACCAACGCT ACCCGGGCAA GCATTGGCTC
GAAGCCGTCA CCGATCCGCT GGGGCACACC ACGACCTATA CCTATAATGC TTGGGGGCAA
ATCACCGCCG TCACCGATGC CGCTGGCGCG ACCACCACGA CCACCTACGA TGCGGTGGGC
CGCGTCAGTG CCACGACGGA TGCACGGGGA ACGGTGACCC GCTATACCTA TCATCCCGAT
GATACCGTGG CCATGGTCGT GCGTAACTGT ACCGACGCAG CGGGGGTTCC CAGCCCTACC
GGGCCGTGTG CCGCCCAAAC CGCTGAGCGC AATCTGACCA CGACCTATGG TTATGACCCC
CGCGGTCGGC AGGTCTGGGA ACGCTCCCCG CTCGGCACCT ATACCGCCCA ACGCTATACC
GCTGCGGGCC ATGTCGCATG GCGCACCAAG GGCTTGGTCG CCGCAGGCTT TCCCGCCACC
CTCCCGACCA CCCCACCCGC CTATGATGCC CGCTGGCCCG ATCAAAACAG TACCACCTTC
TATGCCTATG ATAGTTTTGG CCGCACGACC TTCATTACTG AAACGGGCAT CGTGACGGGG
TCGCTCATGC TGGATTCCAG CCTCGGCTGG CGCTGGACGG CGACAACCCA GCGCGTGACC
CGCTTGACCT ATGATGCCTT TGGTCGGCCA TGGCAGGTGA TGCGGCAGTA TGACCCGCAA
CAGCCGCTCA CGGGGCGGAG CGATGTCAAT CTGACCACCA CCTATACCTA CGATGATACG
GCGACACCCC GTGCCACAAT GATCTGTGAC CCGCTGAATC GCTGTACCCG CACGGCGTAT
GATGCCCTTG GCCGCGTCGT CTCGGTCATC GAGCGCTATC AGGATGGCAT TCCTGGCCCG
GATGCCGCCG ATCGCCAAAC CGTCACCACC TATACTCCAG CGGGACTGGT CGATACGGTG
ATTACGGACT ACCACGATGG TGTCTATGCG CCAGCCACCG ATCCGCTGGC CGACCTGAAA
ACGGCCTATA CCTATGATAG CTGGGGTCGC GCGACCCGTG TAACCACGGC CGTTGATCCG
GCCCCGATCA GTGGGGCAAC CGATGTCAAT CGGACGACCC AGCGCTGGTA TGATGCCAAT
GACCGCCTGT ATGCCGAACA AGATGCCATG GGTCGCATCA CCCGCTATCG CTCCAATGCG
GTGGGGCAGG TCATCGAAAC GGTCACCAAC TGCACGGGCG CGACAGCAAC GCTGCTCACC
GGCACCTGTG ATGCCTTTAC CCCCAGCCAG CCGGATCGCA ATCTGCGGGT GTGGATGCAG
TATACGGCGC TCGGCCAAAC CGAGGCGGTC ACGACCATGC TGGATCTGAC GGCACGGCGC
ACGACAGTGT ACCGCTATGA TCCCCTGGGG CGGATGACTG TCCAGGTCGA CCAGTGTACG
GCCAATGGGC AACCCGTCAT TGCGGGCTGT GATACCCCCA ATCCCACGGA TACCACCCAG
AGTGATACCA ACCGTCGGCA GTTGTGGATC TATGATGCGG AAGGGAATAT CGTCGACCAA
CGGAATCCAC GTGGGTTTCG CGAGACCTTT ACCTATGATC GCCTGAACCA GCAACGCAGT
CGAACATACT ACTTCATCGG CATCACCTGC GAGCCGACGA CCATCACCTG CGCGACCACC
CGCCAACAGG CCGATGGAAC GGGCCAAGTA CGCTGGCAGG TTGATGAAAC CGGGATGGAT
GCCACTGATC TGCGCCGGAA TCTCGTAGCC ACGCATGTTG ATCCCGTGGG ACGGGTCGTG
CGGACGGTTG CTGCTGCCGA CAATACGGGC CAATTCGGGA TCGTCACGAC CACCCAGTAT
GATCGTGGGA GCCGCGCCAT TGCCACCACC AATGCCGATG GGCAAACGAC GCTGCTGGGC
TATCGGGGCG ATGATCGGGT CACCACCGTC GTCGAGAATG CAACCGGACC CGTCGCGCCC
ATCGCCGTCA CGACAACGGC CCAATACGAT GTTCATGGCA ATCTGCGCAG CCTCACCGAT
GCCGAGCAGC ATACCCGCCG CTGGCAGTAT GACAGTCGGG GCTTGGTGCT GAGCCAACAG
GATGCCGCCA ATCGCACGAC CACCATGACC TACGATACGG GTGGGCGGAT GCGCACGAAA
ACTGACCCGC GCGGCAGTGC GATGGCCGTC AGCTACACCT ATGACCTGCG TGATCGGGTG
ACCCAGATCA GCTCGCCTGG CCTCGCAACC ACGATTACCA TGCAGTATGA TGCCAGCGGT
CGGCGCACCC AGATGACCGA TGGGAGTGGG ACAACAAGCT GGACGTATGA CCGCGCGGAT
CGCGTCGCGA CGACCACCCA ACCCGTCGTT GGCGGTCTCC AGTATGGCTA TGATGCGACC
GACCAACGGG TTGCACTGAC CTATCCGAGT GGTGGCCCGC AGCTGGGCTA TGGCTATGAT
ATGGATGGGT TGATCAACGC GATTGATACC AATCGTGATG GCACACGCGA TATTAACTAT
ACGCACACCT TCGATGGTCA GCTGACCAGC ATTGAACCGA TTGGCCAGCC ATTTACCCTC
CAATGGACCT ATGATGCGCT TGATCGGGTC ACCAGCATCG ACCAAGGTCG AGCTGCCGCG
CAACGCCAAC CCGATCACCG CACGCGGGGA CAGCGTGCTC AGCCGCAGGA TCTCGTCCCC
ACGACCACGT TTGGCATGCG CTATACCCTC AGTGCCGCTG GTCGCCACGA GGCCATCACC
GAATGGCAAC AGGGGCCGGG CACCGCACTC CCGCTCCCCG TAGGAGCCTA CCGCCAATAT
CTCCCGCTGG TCCTGGATGA GCGGCTGGAT GGGTGGAGTT TGCAGTATGA CGGCCTCAAT
CGTCTGACGT TTGCGACGCT TAGCCGACCA GGCGATGGAA CGCCCGCGCG GGGATTGGTT
GCCGATGAAA CCGAAGGGGC AACCTATGAT CGGGTCGGCA ATCCACTCAC GCGCCAGCGC
AATGGGAGCA CGAGTGGGCC ATGGAGCTAT ACGGCTGCCG ATCAACGCAC CGATTGGACG
TATGATGCGG CGGGGAATGT GCTCAACGAT GGCACCGCGA CGTATACCTA TGATGCCTTG
GGTCGCTTAA CCAGCCGCAC GCAAGGCGGC ACGACCAGCA CCCATACCTA CAATGGCGAT
GGCCTGCTGG TCGCCAGCAC GACGAATGGC GCAACGACCC GATTCCTGTG GGATACCACG
ACCCCGAATG CCCAATTGGT CGGCACCCAG CAAGCGAGCG GCACCACCTG GTATGTCTGG
TCGCCGATGG GCGGCGGGAC GGCGCGGGTG CTCTATAGCC TTGGGCCAAG TGGCCGTCGG
TGGCTGATGA GTGACGGGAT TGGATCGGTC CGCCGCACGC TGAGCGATAG CGGAACCGTG
ATCCAAACTC GGCAGTGGAC CCCGTTTGGG GTGGAAATCG GCGGCACCGC CAGTGCCGGA
CTGGGCTATG CGGGAGAATG GCAAGAAGCC AGCGGCTTGG TGTATCTGCG CGCACGGTGG
TATGATCCGG CAGCGAGTCG CTTCCTGAGC CGTGACCCGT GGGATGGGAT GATCAGCAAT
CCCCAGACCC TCAATCCCTA TGCCTATGCC CATAACCAAC CCACCCGCTT TACCGATCCC
AGCGGGAAAA CGCCCCTCCT TGCGGCGTTG GCAGGAGTTG AAATTGGATT ACCAATCCTA
TCGGGATTGG GGGCTGCGGC GGCAGCCTGT TATCTCAGCA TTGTCTGTGG TGTTGCCGTC
TTAACCGCCG TGGTAGTTGG TGCGTTTGGC CTCGCCTATC TCTGTCAGCA GGGAGTTCTT
TGCCAACCTG ATGCAGCGAC ACAACCATCC GATTATATTG ATACGGATGA AGAACTCTTT
AACAACGTTG ATTGTGACTA TATTGAACTA CCAGACGGGA ATAACGAAGG TCCCGACGGA
TTACCACCAC TCAGGCCTGG CTTAAAATTT GCCGCAATAA TCTTATTGAC TACGGTAGGG
GTAGTCAATC TGAGCCAGAT GATTCCCCGA TCTGCACCAC AATCTACACC TGAATCAGCT
CCTACGGCAC CTATTTATAC TGACACAAAT CCTAAAGATA ATGATAGAGC CAAACAAGTC
TATTACAGAG GCTTAAGTAA CAGGGATTTA AGAGAATTTA ATACCTATGG TATGATTCGG
AGTAAATTCA TTAGAGACGG GGGGAAAGTA GAAGATGCGG CTGATATGAT CAATAATCCC
CAAGCAAGGG AAAATCATCC TTATGGATCG GGAGGGTCAC CATTTGTTTC AGTAACAAAG
AATCCTACAA TGGCGATGAA GTTTGCAACA AAATACTTAG ATGATCCAAC AGATAATGGA
TCTAGCGGCT ACATTCTTAG AATTACGACA ACACGAAAAT ACTATACCTC TCCAACAAGT
TTTGGCCATG AACAAGAAGC TCTCGCACCT ATTCTGATAG GCGGAATATT TGATACCTTA
GAGATCAATC CACAGCAAGG TGTAATTCCA CCTGGATGGA ATTGA
 
Protein sequence
MSLVPRWLAH LLLITLLGSG VPRASMRPTV LANPQPQPAT APVTPNNELP PLPDLPPVAL 
PLVDTPTLRP TIPVTTTAPS PFGPLLTRIS DPAIAPSPTR PRPLEADQTR APSTPAVTPQ
DVSSCTGDAT NPRAWWQRDS TNQYEFLNPS NVDWSHDPTG TTLQRRFFQT DRSVGIVICL
RPTEQIASWV PNATTWTTTW YRSATPTSTV PTIEFGRVTM TYRGVQNGTL LFTARWNTGN
EVPFDQNCTA TANPCATLAP LNIDALDLKR YPGMWRAVVT STTGMTATPP DSVMINLTVT
DYEASPNPIN RATPTTLTTR FQATPPTRNA QGQPSMPFLG YDYDLYTPAG QHIMTLPHPT
PGFDPSTATA SFATSWNNQL TNPTRPITDG TYLVSFMFAF GEPLWLGWSE PAYIQAIWPL
YVGVPATQLR NKGSCEFGCS PASYQVVAGD PVNTATGNFL HSETDLSIPT EGEPLTWTRS
YNSQALTETS VLGPGWTHPF HLRLVLPTSP LGVSNQISLI SPSGNHYTFR QSGSTYVPLP
GVYATLTADA GGYTVSLPDL TRYRFDLLGR VTTVTTSRGA SLQYQYVTSG PYTGLLDRIT
DTSGQRFLAL TYTGESAATA RLASVRDPLN RTVTYGYQDG MLTTVTDVMG RITTYGLTDG
LITTIDHAGV RTLTNVYETT TASVPVGNPS AWLYAAAPAL TAWPPPSQRR VIAQRSGDGV
ETTYTYAADG TTMATRDGNR VYQEKHWYRP DGSLDHISHE TSLGTITTEQ ATFDPVTLSR
TATTDRYGNT LSQTTNARGQ ATSMTDATGG GVTTTYRAAT EAIAPQQPAL VRDSLGRETR
YQYGPGDVLT SVTTGIAPTT PQGRTTTYTY DQRYPGKHWL EAVTDPLGHT TTYTYNAWGQ
ITAVTDAAGA TTTTTYDAVG RVSATTDARG TVTRYTYHPD DTVAMVVRNC TDAAGVPSPT
GPCAAQTAER NLTTTYGYDP RGRQVWERSP LGTYTAQRYT AAGHVAWRTK GLVAAGFPAT
LPTTPPAYDA RWPDQNSTTF YAYDSFGRTT FITETGIVTG SLMLDSSLGW RWTATTQRVT
RLTYDAFGRP WQVMRQYDPQ QPLTGRSDVN LTTTYTYDDT ATPRATMICD PLNRCTRTAY
DALGRVVSVI ERYQDGIPGP DAADRQTVTT YTPAGLVDTV ITDYHDGVYA PATDPLADLK
TAYTYDSWGR ATRVTTAVDP APISGATDVN RTTQRWYDAN DRLYAEQDAM GRITRYRSNA
VGQVIETVTN CTGATATLLT GTCDAFTPSQ PDRNLRVWMQ YTALGQTEAV TTMLDLTARR
TTVYRYDPLG RMTVQVDQCT ANGQPVIAGC DTPNPTDTTQ SDTNRRQLWI YDAEGNIVDQ
RNPRGFRETF TYDRLNQQRS RTYYFIGITC EPTTITCATT RQQADGTGQV RWQVDETGMD
ATDLRRNLVA THVDPVGRVV RTVAAADNTG QFGIVTTTQY DRGSRAIATT NADGQTTLLG
YRGDDRVTTV VENATGPVAP IAVTTTAQYD VHGNLRSLTD AEQHTRRWQY DSRGLVLSQQ
DAANRTTTMT YDTGGRMRTK TDPRGSAMAV SYTYDLRDRV TQISSPGLAT TITMQYDASG
RRTQMTDGSG TTSWTYDRAD RVATTTQPVV GGLQYGYDAT DQRVALTYPS GGPQLGYGYD
MDGLINAIDT NRDGTRDINY THTFDGQLTS IEPIGQPFTL QWTYDALDRV TSIDQGRAAA
QRQPDHRTRG QRAQPQDLVP TTTFGMRYTL SAAGRHEAIT EWQQGPGTAL PLPVGAYRQY
LPLVLDERLD GWSLQYDGLN RLTFATLSRP GDGTPARGLV ADETEGATYD RVGNPLTRQR
NGSTSGPWSY TAADQRTDWT YDAAGNVLND GTATYTYDAL GRLTSRTQGG TTSTHTYNGD
GLLVASTTNG ATTRFLWDTT TPNAQLVGTQ QASGTTWYVW SPMGGGTARV LYSLGPSGRR
WLMSDGIGSV RRTLSDSGTV IQTRQWTPFG VEIGGTASAG LGYAGEWQEA SGLVYLRARW
YDPAASRFLS RDPWDGMISN PQTLNPYAYA HNQPTRFTDP SGKTPLLAAL AGVEIGLPIL
SGLGAAAAAC YLSIVCGVAV LTAVVVGAFG LAYLCQQGVL CQPDAATQPS DYIDTDEELF
NNVDCDYIEL PDGNNEGPDG LPPLRPGLKF AAIILLTTVG VVNLSQMIPR SAPQSTPESA
PTAPIYTDTN PKDNDRAKQV YYRGLSNRDL REFNTYGMIR SKFIRDGGKV EDAADMINNP
QARENHPYGS GGSPFVSVTK NPTMAMKFAT KYLDDPTDNG SSGYILRITT TRKYYTSPTS
FGHEQEALAP ILIGGIFDTL EINPQQGVIP PGWN