Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1967 |
Symbol | |
ID | 5733856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2404483 |
End bp | 2411943 |
Gene Length | 7461 bp |
Protein Length | 2486 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279111 |
Product | YD repeat-containing protein |
Protein accession | YP_001544738 |
Protein GI | 159898491 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCCTA TTCGTCGTCG CCGTTGGATA CCGCTCGCAC TCTTGCTGAT AGTAATTGTG TCGTTATTGC CGCTTCAATC AACATCAGCG ATTCCCATGC TTCCACAATC GGATGCCCTG ACCAAGCGTG TTCCAGAGAT TCTTGATGAA CCACCCGTCA ATAACCGCCC TTCCCCAGCT CAATCAGCAG TAACGCCCAA AGAAGAAGCT CAGAATATCG CACCCGTCGC TGAATCCCAG CATTCGTTTA GCGGGATTGA CCTGACCTTC ACGGGCGACA CTATCAGCCA AGTCCCAGTT GCTGCAACCA ATGCGGCAGT CTATGTAGCC CAAGCAACGG TAAACGACCT TTACATCGTT AATGGTCAAC CCCACTATAA TTCAGATAAC CTCAGCTATT ATGAAGATAC TGGGATTTTC ATGAACCTTC CCCGTGATAG TTTCACAGTG AATGTTGAAG TTGCAGTCGG GAATATAGGT TTTATTTATA TCCAATCTGT CAATAATCCT CAAATATATC AAAAAGCTGC GTTCACCACA CCCCAAGGCA ATCCTCATGA TATGCATTTT ACGATCGGAC CTTCGATCCC TCGTAATCAA CCAATTCATA TCTATGTTGG GTGTTTCAAC CCTCAGCCAG GGGGATATAA TAACTGTCGC TATACGAATT TCCAATTCTA CCTTGATAGC GACGTTTTGA CACGCATTCC TGATCCAGCG TATGATCATC CAAAAAAACT TCTCCCCAGC TATATTAATC AAGTAAATAC AGGTAAATAC TTACCCGCAG ATGGAGCAGG CAATCCCTTC TTCATGTACG GCTATTTACC ACCAGTTCAA GGAATTGACG AAAAATTTCT TTATCAAACT AAGGCTTTTA ATTTTCCAGC CCTCCAACCA AACGATACGA TTACGATTAA CTTTCGCTAT TCGGCATTCT CATGGCCAGG AACTACTCCA GCTAATAGTT TTGGCTCACA TATCAACCTA TTCTTTAGCG ATGCAAACAG TTCTACCTAT GATGTGTTCG GCTGGGACAC ATTGTCGCGA GATCTAACTA ATCCTATAAA CATCCCCATA CCAGGCTTTT CTACCTATAG CTATATTCCG TGGCGTAGTG GATCGATTAC CCTGAACCAA AATATATATT CTCAAATCTC CAATAAAACT ATTCGGATTA ACCTTGCGCC TCAATCAGCA GACAGCCAGT ATTATCCGTA TTTAGCAGGG ATTGATTCGA TTCAATTCTA TCGCAACGGG AAATTAATGA ATTTCTCCTC GATTGAGGAT CGCATTCCCG CCGACCAAAA TGGTGGGAGT TGTGCCCCAT GTACGGCTGC GGGCGAAAGC ACCATGATTG TTGGCGATCC GGTCAATACC TTATCGGGAG CCTATATCGA GCATGGGGTT GATCAGCAGA TTCCCACCGG TGGCACGGCT CTCAGTCTGC AACGTACCTA CGTCTCAACC TTGGCCAATA GTAGCCTCTA TCCACAAAGT TCGCTTGGGC TTGGGTGGCG TTTTAATTAT GCCGAAAGCT TGACTTTGCC GGTAGAAATT ACGGGCGTAA CAACAGTTGG CGCTGAATCC AACACCGTCA TTTACGAAGC AGCCGATGGC AATCGCTATC GCTTCAAGCG CCATGGCAAT GATTTTGTAG CAGCACGTGG TATTCGCTTA ACGTTGACCA AAAATGCCAC TAGCTATACC TTGCAAGCTG CCGATCAATC AGCTAAAACT TTTGATCTGC AAGGCCGTCT CCTCCAACTA CGCGATGTCT ATGGCCGCAC CCAAACGATT ACCCTCGGAA CCAGCGGAAT TCAAACCGGA CTGCCGGTCG AAGTGCGTGA CGATTTGAGC CAACGCAAGC TGCGGCTTGA ATATACCACC ACGAGCGGTG TTGTTCGGTT GTATCGCCTG CTCAATGATT TGAATCAGGC AACCGTCTAT GAGTATACCC AAGGCCGTTT GGAATGGGTT ACCTCGCCCC AAAGCATCCT CAGCCGCTAT ACCTATGATC CAACAAGTTT GCTCATGAGC AGCGTTACCC GCAACGCAAC CAGCAAAAGT CCTTTACCAA CCGCCGACCT TGTGATGAGT TATACCAATG GGCGGGTCAC TCAGCAAACT GCTCCTGCCG ATAATACGGT CTGGGCATAT ACCTATACTG GAACGGGCGC AAATCAAGCC CAAACCACGA CCATGACCGT CAGTCGTGGC GGCATTGTGC TTGATGTCCA ACGCTCACAT TATCGTGCGG ATGGGACATT CGCTTGGCAA GAACGCAACG ATGCCTTGCT TGAATATGTG GCTAATGACC GAATGCTCGC CCCAACTGCA GGGGTAAAGG CTGATAAAAG CATTGCGCTA CATCATAATA ATCCTAAAGG GCAACCAGTT GAAGTGTATG CAGGCGCAAT TGAAGGCGCT ATACTTGGGA CAACTCAACG CACGTTTGCG ATTAGTTATA CGCCCACTGG TTATCCGGCA ATTGTCACCA ACACCAATGG GCTAGAAACC TACACGACCT ATAATAGTGC TAATCAGCCG CTCACGACCA AAGCCGGTAC AGGAACCCTC GTTCCAACCC AAACCTGGAC CTATGACGCG GCAACCAAGC AACCGCAAAC GATCACCAGT CCTGACGGGA TTGTCACCCG CTATACCTAT ACCGCTGCTG GTCAAGTGGC CTCGACAGAA GTGGGATACG GCACCAGCAG TGTTCAAAAG ACCACCTATC AATACGATAG TCTTGGTCGA CAAACCCATA TGACCCTAGG TGATGGTCTG CCCAATGCAA CAACAACGGT TACCGAGTAT CGCCCCGATA ATCAAATTGC GAAAATCACC CGCAATTATG TGGCGGGCAA TACCACTGAT CCGCGTAAAA ATGTGGTGAC CGAGTATGGC TACAATGCCA AAGGGCAATT GATTTGGACA AAGACTCCCG ATGGTCGCTA TCGTGGCGTG ACCAGCTATG ATGCTTTAGG TCGCGTTCGT TGGGTCGCCG ATAATACGGT CAACCCCAGC ACTGGAGCAA TTGCGATTGC TGATACCTCC AGCACCAGCC TGCCACCAAG CTTTAGCCCA CAACGCCCCG ATGCCAATAT TGTCACCATG TATGGCTACG ATTGGCATGG ACGCACCACC CTGATTACCC AAACAGGGAT TGTCACCGGC ACATTTAATG TCGCAACCTT ACAATGGCAA TCCAGTACTT CACGGGTCAC TCGCATCGAG TATGATCAAT TAAGCCGACC TGTCACAACA ACCTACAATT ACCGACCTGA TATCTATGCG GGTCAATTCA ACACGAATCA TCCCGATGTC AATGTTCCAA CCTATACCTA TTACGATGGG GCAGGGAAAG TTACTTGGAC ACGCGATGGA TTACGACGTT GGAATCATTT TGAATATGAT CAGCTTGGCC GTCCAGTAAC GACGACGCTC AATTATGAAA ATGGCGACCC ACTAACGGTT GATCCAGTCA ATGCAACCTG GGCAAGCACA AACGACACCG ATATTATCTC GATTACCCAC TATGATCAGG GTGGACAGAT CGACCATACC ATTCGTAATT TCGTCGATGG GGTGCGCGAT ACCACGATTC CAGATAGTTC TATCCCATGG CAGATCACTG ATGTTGTCAC GGATGTGGTT ACACGCTACC ATTACGACCA GGCTGGTCGG ATGGATCAAA CGATCTCAAA TTATGTTGCT GGAGCAACTG ACCCTGAGTT TAACCGCGTT GCAAACACTA TTTACGATCA AGCAACTGGA CGAACGCTTG GAGTAACCGA CGCATGGAAT CAATATACGG TCTATGAATA TGATCAACTT GGACGATCAG TCAGCAGCGT AACGAATTGT CGCACCAGCA GCGGTGTTCC AACGTACAAT CGCAACACAT GTGCCAGTAC AACCTCGCAG CGCAATCTTC CCTCACCTTT ATCGGTCTAC GATCAGCTAG GGCGGCTAAG CCAAACCTTG AGCGTTGATG CGATAGGTGG GTTAGGCTTT AATGGACCAC CCACCCAATA TAGCTATGAT GGGCTTGGAC AGTTGGTCGA AACTATTGTC AATAGCCAAC CAGGCCAAGC TGCATCCGCA ACAGTCAATG TCAAATCGAC CACCACCTAT CTCGATGCTG CCGGCAGTCG CTGGAATGAA ACCGACCCGA CCGGCAAAAT AACCCAATAC GAAACCGATG GGTTTGGACA AGTCAACAAA GTTATCGATC CAGCTGGATT GATCACGCTG AGCGGACCAC GCTGGACAAA AACACCCGAT GGTCAAGTTC GTGTGGCGAT GATTGATGGC CTTGGCCGCA CAATCAAAAC GGTGAGCAAT TATCAGGATG GGGTCTACGT TCCAGCAACC GATACCACCA CCCACGATAT CATCACCCTC ACCCGCTATG ATGTTGGTGG TCGCCAAGTT GCCGTGACCA ATAGCCTCAA TATTCGAACG CGCTACGATT ATGACTTACG CGATAACTTA ATTCGGATTA TTGAGAATGA TCTTGGCACA TGTACCGCCA GCGATACGAA TGACTGTCAA GTAACCACCG AATATCGCTA TGATCGGGTC GGCAACCTAA TCAAAACGAT CGATGCCCGC GGCTACAGTA CCACCAAGAG TTACGATAGC CTTGGGCGGG TACGGCGCAC AACCGATGGC TTAAACCGCC ATACACTCTT GACCTATAAC CAAGATGATA CTGTAGCAAC CATTGCCCCA GCAACAGGCA ATCCAATCTC CTATAGTTAC GATGAGGTGG GTCATCAAAT CCAAGCAACA GGCTGGGATT CAACCGCCGT TCAACAATGG ACCTACGACC TCGCAGGACG GTTATCATCA GTCTATGATG CATCGGCACA AGATGCCGCC GCCGGACGAA CAGGCACCAT CAGCTACAAA TATGATGTCC TGAATCGGAT ATCGAGCGTC GCCCAATCAC TTGAAGGTGA TCCTCAAGGC AATTGGACGC TGACCTATGG CTATGATGCA GCCGGACGTA CCACAGCAAT CGGGGGAACA AGCTATTCCT ATGATACCGC TGGACGTTTA AGCTCGGTTA TTCGAGGTGG TTCACCCTTA GCACAATATG CGTATCAGGG GTCAACAGGT CGCGTAGCAA GTATCACCCG CTTAGAGGCC GGAAACAATC GACTGATTGA AAGCCTTGGC TATGATACCC GTGGCCGCGT AACCAGTCGC AGCATCACCG GAGCAACCAC GGCTGCGGCC ATGCCAACGC CAACAGCACC TCAGTTGGCG CAATTCACCT ATGCCTATGA TCGCGCTGAT CAGCCAACGA GTTTTACCGA AACCCAACTC AATAGCGCGG GCACAGCCAC AACCAATGAA ACGCATAGCT ATACCTACGA CAAACTCAAT CGCCTGATCA GCGAAAATGC CAGTGGCACA ACCACCACTT TTCGCTTTGA TGCAGCTGGT AATCGGATTA AAGTTAACAG CCAAAACTCC AGTTACAATG CCGCCAATGA GCGAGTAGGA TTCGCCTATG ATGTGTATGG CAATGTCACT GGTGACGGCC AAAATGTCTT TAGTTATGAT GGCTTCAACC GCCTCAAGAG CGTCATCAAG GGAGCGAATG TCTATAGTTA TCGCTACCAT GAGGAAACGC TGACCAGCCG GTTCGTCAAT GGCACGCGGA CGCAAGCCTT TAATTACGAT CGGGTTGGAA CGTCATTAAG TAGCATCCTT CGGATTCATA CGATCCAACC AACCGATACG CTGTATACCA CCTATATCAC CGGGCTTGGT GGCGATATCA TTCAATCGGA AGCAACCTTT AATGGCGCAC CACAAACCAA TGGGAAGCTC TTCCTAATTA GTGATAACCA AAGCACAGTA CGGATGTTGG TGAATGCTAG CAATGGCGCA CGCACTAAAC AAGATAGTGA TGCTTGGGGC AATTTTGTGC CTGCGGCAGG CCAAACCCTA GCACCATCGT CAATTCGCTA CACCGGCGAA TACACCGACC GTGACACAGG CTTGGTGTTC TTACGGGCAC GCTGGTACAA CCCTGCTAGC GGAACCTTGT TGAGCAAAGA TCCATTTGCT GGTTTCGCCA ATCAACCTCA ATCACAACAT CCCTACATCT ACGCAGGCAA TAACCCAACC ACTAATAGCG ACCCAACCGG GCGCACCTGT GAAGGCTGTC AACCAGCAGG GGTAACCCGC GATCAATGGC ATCAATATGT TGATCTGTAT GATGTAACCT ATTCAGGGTT TGTAGGGGCG TTGATCAATC ATCCATTTGA TCGGATGGGT GATGCAACGG TCTTCGATAA CTATGCCCAT GCGACTCGCT CATCCCTCGA TTTCATCTAC TATCTCAAGA ACTTCTTATG GGGTTGTTCG ATCCAACAAT ACGGAGCCTT AGAGAATCTA TGGTCAAATG CCGAAACATG GTCAGAACAC TTTTACCATG ATCGGCTGCC ATCGGTTCCA TTGCATCGCT ATGATAGTAT CGAAGATGCG ATGGATGCCT TGGGTGATGA GGCATATCTC AACTATTTCA TTGAAACAAA TCTTGCTGAT GGCAATCATC TTGATCCAAT TGACTCGATA AGCCCAGGGC TTGCCGATGT CCCGGTAGTC CCATCTACAC CACGGCAGAA TAATCCATCG ACCGATGTTG ATGATCCGGC TCCAGCGAAA TCGCCAAAAC CTAGTAATGA CCCCGATGTC GATGTTGACA ATCCAGCCCC AAGCATATCA CCCAAAGTCG GAGAATCTTG TAGCTTTGAT GCGACAACGT TGGTTGCGAC CGATGAAGGG CTGGTTCCAA TTGCCACGAT TCAAGTTGGC GATCTGGTCT TAGCCTACGA CGAAGCTACT GGCACAACTG ACTACTACAC CGTTACCGCC AGCTTTGTGC ATAGTGATCC AGTCTTAGTT GACGTTGCCA TTGACAATGA ATGGATCAAC ACAACTCCCG AGCATCCCTT CTTCACCGCC GATGGCTGGA CAGATGCTGA AAACCTTGAA CTTAGCGACT GGGTTGCCAG CGCTGATGGC GAGTGGGGAA AAATCACCGC GTTGCGCCTA CGTGGCGACC AACGCTTTAT GTACAACCTT ACGGTTGCCG AAGCTCATAC CTTCTTTGTT GGTGATGGTC GCTGGTTAGT CCATAATGTT TGTATTGATG GTGTCGAATA TCCTGTCGAT AAACCAACTA ACCAAACCAA AAATGACTCT ATTTGGTATG ATACTCAAGC GCAGGCACGC CAAATGGCAC GTCAAAAAAT GGGACATAAT CCTGTTGAAA TTGAACCAAA TAAGCTACGG CGCAGGGATG GTACATGGCA ATATCGTGCA AAACCAGGAG ATTTAGCAGG TGATAACCAA GGCCCACACA TTCATTTGGA AAAACTAGAT CCCAAGACAG GCGATGTACT TATCAACCTA CACTTACGTT GGAGAAAATA A
|
Protein sequence | MLPIRRRRWI PLALLLIVIV SLLPLQSTSA IPMLPQSDAL TKRVPEILDE PPVNNRPSPA QSAVTPKEEA QNIAPVAESQ HSFSGIDLTF TGDTISQVPV AATNAAVYVA QATVNDLYIV NGQPHYNSDN LSYYEDTGIF MNLPRDSFTV NVEVAVGNIG FIYIQSVNNP QIYQKAAFTT PQGNPHDMHF TIGPSIPRNQ PIHIYVGCFN PQPGGYNNCR YTNFQFYLDS DVLTRIPDPA YDHPKKLLPS YINQVNTGKY LPADGAGNPF FMYGYLPPVQ GIDEKFLYQT KAFNFPALQP NDTITINFRY SAFSWPGTTP ANSFGSHINL FFSDANSSTY DVFGWDTLSR DLTNPINIPI PGFSTYSYIP WRSGSITLNQ NIYSQISNKT IRINLAPQSA DSQYYPYLAG IDSIQFYRNG KLMNFSSIED RIPADQNGGS CAPCTAAGES TMIVGDPVNT LSGAYIEHGV DQQIPTGGTA LSLQRTYVST LANSSLYPQS SLGLGWRFNY AESLTLPVEI TGVTTVGAES NTVIYEAADG NRYRFKRHGN DFVAARGIRL TLTKNATSYT LQAADQSAKT FDLQGRLLQL RDVYGRTQTI TLGTSGIQTG LPVEVRDDLS QRKLRLEYTT TSGVVRLYRL LNDLNQATVY EYTQGRLEWV TSPQSILSRY TYDPTSLLMS SVTRNATSKS PLPTADLVMS YTNGRVTQQT APADNTVWAY TYTGTGANQA QTTTMTVSRG GIVLDVQRSH YRADGTFAWQ ERNDALLEYV ANDRMLAPTA GVKADKSIAL HHNNPKGQPV EVYAGAIEGA ILGTTQRTFA ISYTPTGYPA IVTNTNGLET YTTYNSANQP LTTKAGTGTL VPTQTWTYDA ATKQPQTITS PDGIVTRYTY TAAGQVASTE VGYGTSSVQK TTYQYDSLGR QTHMTLGDGL PNATTTVTEY RPDNQIAKIT RNYVAGNTTD PRKNVVTEYG YNAKGQLIWT KTPDGRYRGV TSYDALGRVR WVADNTVNPS TGAIAIADTS STSLPPSFSP QRPDANIVTM YGYDWHGRTT LITQTGIVTG TFNVATLQWQ SSTSRVTRIE YDQLSRPVTT TYNYRPDIYA GQFNTNHPDV NVPTYTYYDG AGKVTWTRDG LRRWNHFEYD QLGRPVTTTL NYENGDPLTV DPVNATWAST NDTDIISITH YDQGGQIDHT IRNFVDGVRD TTIPDSSIPW QITDVVTDVV TRYHYDQAGR MDQTISNYVA GATDPEFNRV ANTIYDQATG RTLGVTDAWN QYTVYEYDQL GRSVSSVTNC RTSSGVPTYN RNTCASTTSQ RNLPSPLSVY DQLGRLSQTL SVDAIGGLGF NGPPTQYSYD GLGQLVETIV NSQPGQAASA TVNVKSTTTY LDAAGSRWNE TDPTGKITQY ETDGFGQVNK VIDPAGLITL SGPRWTKTPD GQVRVAMIDG LGRTIKTVSN YQDGVYVPAT DTTTHDIITL TRYDVGGRQV AVTNSLNIRT RYDYDLRDNL IRIIENDLGT CTASDTNDCQ VTTEYRYDRV GNLIKTIDAR GYSTTKSYDS LGRVRRTTDG LNRHTLLTYN QDDTVATIAP ATGNPISYSY DEVGHQIQAT GWDSTAVQQW TYDLAGRLSS VYDASAQDAA AGRTGTISYK YDVLNRISSV AQSLEGDPQG NWTLTYGYDA AGRTTAIGGT SYSYDTAGRL SSVIRGGSPL AQYAYQGSTG RVASITRLEA GNNRLIESLG YDTRGRVTSR SITGATTAAA MPTPTAPQLA QFTYAYDRAD QPTSFTETQL NSAGTATTNE THSYTYDKLN RLISENASGT TTTFRFDAAG NRIKVNSQNS SYNAANERVG FAYDVYGNVT GDGQNVFSYD GFNRLKSVIK GANVYSYRYH EETLTSRFVN GTRTQAFNYD RVGTSLSSIL RIHTIQPTDT LYTTYITGLG GDIIQSEATF NGAPQTNGKL FLISDNQSTV RMLVNASNGA RTKQDSDAWG NFVPAAGQTL APSSIRYTGE YTDRDTGLVF LRARWYNPAS GTLLSKDPFA GFANQPQSQH PYIYAGNNPT TNSDPTGRTC EGCQPAGVTR DQWHQYVDLY DVTYSGFVGA LINHPFDRMG DATVFDNYAH ATRSSLDFIY YLKNFLWGCS IQQYGALENL WSNAETWSEH FYHDRLPSVP LHRYDSIEDA MDALGDEAYL NYFIETNLAD GNHLDPIDSI SPGLADVPVV PSTPRQNNPS TDVDDPAPAK SPKPSNDPDV DVDNPAPSIS PKVGESCSFD ATTLVATDEG LVPIATIQVG DLVLAYDEAT GTTDYYTVTA SFVHSDPVLV DVAIDNEWIN TTPEHPFFTA DGWTDAENLE LSDWVASADG EWGKITALRL RGDQRFMYNL TVAEAHTFFV GDGRWLVHNV CIDGVEYPVD KPTNQTKNDS IWYDTQAQAR QMARQKMGHN PVEIEPNKLR RRDGTWQYRA KPGDLAGDNQ GPHIHLEKLD PKTGDVLINL HLRWRK
|
| |