Gene Haur_1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1967 
Symbol 
ID5733856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2404483 
End bp2411943 
Gene Length7461 bp 
Protein Length2486 aa 
Translation table11 
GC content49% 
IMG OID641279111 
ProductYD repeat-containing protein 
Protein accessionYP_001544738 
Protein GI159898491 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCCTA TTCGTCGTCG CCGTTGGATA CCGCTCGCAC TCTTGCTGAT AGTAATTGTG 
TCGTTATTGC CGCTTCAATC AACATCAGCG ATTCCCATGC TTCCACAATC GGATGCCCTG
ACCAAGCGTG TTCCAGAGAT TCTTGATGAA CCACCCGTCA ATAACCGCCC TTCCCCAGCT
CAATCAGCAG TAACGCCCAA AGAAGAAGCT CAGAATATCG CACCCGTCGC TGAATCCCAG
CATTCGTTTA GCGGGATTGA CCTGACCTTC ACGGGCGACA CTATCAGCCA AGTCCCAGTT
GCTGCAACCA ATGCGGCAGT CTATGTAGCC CAAGCAACGG TAAACGACCT TTACATCGTT
AATGGTCAAC CCCACTATAA TTCAGATAAC CTCAGCTATT ATGAAGATAC TGGGATTTTC
ATGAACCTTC CCCGTGATAG TTTCACAGTG AATGTTGAAG TTGCAGTCGG GAATATAGGT
TTTATTTATA TCCAATCTGT CAATAATCCT CAAATATATC AAAAAGCTGC GTTCACCACA
CCCCAAGGCA ATCCTCATGA TATGCATTTT ACGATCGGAC CTTCGATCCC TCGTAATCAA
CCAATTCATA TCTATGTTGG GTGTTTCAAC CCTCAGCCAG GGGGATATAA TAACTGTCGC
TATACGAATT TCCAATTCTA CCTTGATAGC GACGTTTTGA CACGCATTCC TGATCCAGCG
TATGATCATC CAAAAAAACT TCTCCCCAGC TATATTAATC AAGTAAATAC AGGTAAATAC
TTACCCGCAG ATGGAGCAGG CAATCCCTTC TTCATGTACG GCTATTTACC ACCAGTTCAA
GGAATTGACG AAAAATTTCT TTATCAAACT AAGGCTTTTA ATTTTCCAGC CCTCCAACCA
AACGATACGA TTACGATTAA CTTTCGCTAT TCGGCATTCT CATGGCCAGG AACTACTCCA
GCTAATAGTT TTGGCTCACA TATCAACCTA TTCTTTAGCG ATGCAAACAG TTCTACCTAT
GATGTGTTCG GCTGGGACAC ATTGTCGCGA GATCTAACTA ATCCTATAAA CATCCCCATA
CCAGGCTTTT CTACCTATAG CTATATTCCG TGGCGTAGTG GATCGATTAC CCTGAACCAA
AATATATATT CTCAAATCTC CAATAAAACT ATTCGGATTA ACCTTGCGCC TCAATCAGCA
GACAGCCAGT ATTATCCGTA TTTAGCAGGG ATTGATTCGA TTCAATTCTA TCGCAACGGG
AAATTAATGA ATTTCTCCTC GATTGAGGAT CGCATTCCCG CCGACCAAAA TGGTGGGAGT
TGTGCCCCAT GTACGGCTGC GGGCGAAAGC ACCATGATTG TTGGCGATCC GGTCAATACC
TTATCGGGAG CCTATATCGA GCATGGGGTT GATCAGCAGA TTCCCACCGG TGGCACGGCT
CTCAGTCTGC AACGTACCTA CGTCTCAACC TTGGCCAATA GTAGCCTCTA TCCACAAAGT
TCGCTTGGGC TTGGGTGGCG TTTTAATTAT GCCGAAAGCT TGACTTTGCC GGTAGAAATT
ACGGGCGTAA CAACAGTTGG CGCTGAATCC AACACCGTCA TTTACGAAGC AGCCGATGGC
AATCGCTATC GCTTCAAGCG CCATGGCAAT GATTTTGTAG CAGCACGTGG TATTCGCTTA
ACGTTGACCA AAAATGCCAC TAGCTATACC TTGCAAGCTG CCGATCAATC AGCTAAAACT
TTTGATCTGC AAGGCCGTCT CCTCCAACTA CGCGATGTCT ATGGCCGCAC CCAAACGATT
ACCCTCGGAA CCAGCGGAAT TCAAACCGGA CTGCCGGTCG AAGTGCGTGA CGATTTGAGC
CAACGCAAGC TGCGGCTTGA ATATACCACC ACGAGCGGTG TTGTTCGGTT GTATCGCCTG
CTCAATGATT TGAATCAGGC AACCGTCTAT GAGTATACCC AAGGCCGTTT GGAATGGGTT
ACCTCGCCCC AAAGCATCCT CAGCCGCTAT ACCTATGATC CAACAAGTTT GCTCATGAGC
AGCGTTACCC GCAACGCAAC CAGCAAAAGT CCTTTACCAA CCGCCGACCT TGTGATGAGT
TATACCAATG GGCGGGTCAC TCAGCAAACT GCTCCTGCCG ATAATACGGT CTGGGCATAT
ACCTATACTG GAACGGGCGC AAATCAAGCC CAAACCACGA CCATGACCGT CAGTCGTGGC
GGCATTGTGC TTGATGTCCA ACGCTCACAT TATCGTGCGG ATGGGACATT CGCTTGGCAA
GAACGCAACG ATGCCTTGCT TGAATATGTG GCTAATGACC GAATGCTCGC CCCAACTGCA
GGGGTAAAGG CTGATAAAAG CATTGCGCTA CATCATAATA ATCCTAAAGG GCAACCAGTT
GAAGTGTATG CAGGCGCAAT TGAAGGCGCT ATACTTGGGA CAACTCAACG CACGTTTGCG
ATTAGTTATA CGCCCACTGG TTATCCGGCA ATTGTCACCA ACACCAATGG GCTAGAAACC
TACACGACCT ATAATAGTGC TAATCAGCCG CTCACGACCA AAGCCGGTAC AGGAACCCTC
GTTCCAACCC AAACCTGGAC CTATGACGCG GCAACCAAGC AACCGCAAAC GATCACCAGT
CCTGACGGGA TTGTCACCCG CTATACCTAT ACCGCTGCTG GTCAAGTGGC CTCGACAGAA
GTGGGATACG GCACCAGCAG TGTTCAAAAG ACCACCTATC AATACGATAG TCTTGGTCGA
CAAACCCATA TGACCCTAGG TGATGGTCTG CCCAATGCAA CAACAACGGT TACCGAGTAT
CGCCCCGATA ATCAAATTGC GAAAATCACC CGCAATTATG TGGCGGGCAA TACCACTGAT
CCGCGTAAAA ATGTGGTGAC CGAGTATGGC TACAATGCCA AAGGGCAATT GATTTGGACA
AAGACTCCCG ATGGTCGCTA TCGTGGCGTG ACCAGCTATG ATGCTTTAGG TCGCGTTCGT
TGGGTCGCCG ATAATACGGT CAACCCCAGC ACTGGAGCAA TTGCGATTGC TGATACCTCC
AGCACCAGCC TGCCACCAAG CTTTAGCCCA CAACGCCCCG ATGCCAATAT TGTCACCATG
TATGGCTACG ATTGGCATGG ACGCACCACC CTGATTACCC AAACAGGGAT TGTCACCGGC
ACATTTAATG TCGCAACCTT ACAATGGCAA TCCAGTACTT CACGGGTCAC TCGCATCGAG
TATGATCAAT TAAGCCGACC TGTCACAACA ACCTACAATT ACCGACCTGA TATCTATGCG
GGTCAATTCA ACACGAATCA TCCCGATGTC AATGTTCCAA CCTATACCTA TTACGATGGG
GCAGGGAAAG TTACTTGGAC ACGCGATGGA TTACGACGTT GGAATCATTT TGAATATGAT
CAGCTTGGCC GTCCAGTAAC GACGACGCTC AATTATGAAA ATGGCGACCC ACTAACGGTT
GATCCAGTCA ATGCAACCTG GGCAAGCACA AACGACACCG ATATTATCTC GATTACCCAC
TATGATCAGG GTGGACAGAT CGACCATACC ATTCGTAATT TCGTCGATGG GGTGCGCGAT
ACCACGATTC CAGATAGTTC TATCCCATGG CAGATCACTG ATGTTGTCAC GGATGTGGTT
ACACGCTACC ATTACGACCA GGCTGGTCGG ATGGATCAAA CGATCTCAAA TTATGTTGCT
GGAGCAACTG ACCCTGAGTT TAACCGCGTT GCAAACACTA TTTACGATCA AGCAACTGGA
CGAACGCTTG GAGTAACCGA CGCATGGAAT CAATATACGG TCTATGAATA TGATCAACTT
GGACGATCAG TCAGCAGCGT AACGAATTGT CGCACCAGCA GCGGTGTTCC AACGTACAAT
CGCAACACAT GTGCCAGTAC AACCTCGCAG CGCAATCTTC CCTCACCTTT ATCGGTCTAC
GATCAGCTAG GGCGGCTAAG CCAAACCTTG AGCGTTGATG CGATAGGTGG GTTAGGCTTT
AATGGACCAC CCACCCAATA TAGCTATGAT GGGCTTGGAC AGTTGGTCGA AACTATTGTC
AATAGCCAAC CAGGCCAAGC TGCATCCGCA ACAGTCAATG TCAAATCGAC CACCACCTAT
CTCGATGCTG CCGGCAGTCG CTGGAATGAA ACCGACCCGA CCGGCAAAAT AACCCAATAC
GAAACCGATG GGTTTGGACA AGTCAACAAA GTTATCGATC CAGCTGGATT GATCACGCTG
AGCGGACCAC GCTGGACAAA AACACCCGAT GGTCAAGTTC GTGTGGCGAT GATTGATGGC
CTTGGCCGCA CAATCAAAAC GGTGAGCAAT TATCAGGATG GGGTCTACGT TCCAGCAACC
GATACCACCA CCCACGATAT CATCACCCTC ACCCGCTATG ATGTTGGTGG TCGCCAAGTT
GCCGTGACCA ATAGCCTCAA TATTCGAACG CGCTACGATT ATGACTTACG CGATAACTTA
ATTCGGATTA TTGAGAATGA TCTTGGCACA TGTACCGCCA GCGATACGAA TGACTGTCAA
GTAACCACCG AATATCGCTA TGATCGGGTC GGCAACCTAA TCAAAACGAT CGATGCCCGC
GGCTACAGTA CCACCAAGAG TTACGATAGC CTTGGGCGGG TACGGCGCAC AACCGATGGC
TTAAACCGCC ATACACTCTT GACCTATAAC CAAGATGATA CTGTAGCAAC CATTGCCCCA
GCAACAGGCA ATCCAATCTC CTATAGTTAC GATGAGGTGG GTCATCAAAT CCAAGCAACA
GGCTGGGATT CAACCGCCGT TCAACAATGG ACCTACGACC TCGCAGGACG GTTATCATCA
GTCTATGATG CATCGGCACA AGATGCCGCC GCCGGACGAA CAGGCACCAT CAGCTACAAA
TATGATGTCC TGAATCGGAT ATCGAGCGTC GCCCAATCAC TTGAAGGTGA TCCTCAAGGC
AATTGGACGC TGACCTATGG CTATGATGCA GCCGGACGTA CCACAGCAAT CGGGGGAACA
AGCTATTCCT ATGATACCGC TGGACGTTTA AGCTCGGTTA TTCGAGGTGG TTCACCCTTA
GCACAATATG CGTATCAGGG GTCAACAGGT CGCGTAGCAA GTATCACCCG CTTAGAGGCC
GGAAACAATC GACTGATTGA AAGCCTTGGC TATGATACCC GTGGCCGCGT AACCAGTCGC
AGCATCACCG GAGCAACCAC GGCTGCGGCC ATGCCAACGC CAACAGCACC TCAGTTGGCG
CAATTCACCT ATGCCTATGA TCGCGCTGAT CAGCCAACGA GTTTTACCGA AACCCAACTC
AATAGCGCGG GCACAGCCAC AACCAATGAA ACGCATAGCT ATACCTACGA CAAACTCAAT
CGCCTGATCA GCGAAAATGC CAGTGGCACA ACCACCACTT TTCGCTTTGA TGCAGCTGGT
AATCGGATTA AAGTTAACAG CCAAAACTCC AGTTACAATG CCGCCAATGA GCGAGTAGGA
TTCGCCTATG ATGTGTATGG CAATGTCACT GGTGACGGCC AAAATGTCTT TAGTTATGAT
GGCTTCAACC GCCTCAAGAG CGTCATCAAG GGAGCGAATG TCTATAGTTA TCGCTACCAT
GAGGAAACGC TGACCAGCCG GTTCGTCAAT GGCACGCGGA CGCAAGCCTT TAATTACGAT
CGGGTTGGAA CGTCATTAAG TAGCATCCTT CGGATTCATA CGATCCAACC AACCGATACG
CTGTATACCA CCTATATCAC CGGGCTTGGT GGCGATATCA TTCAATCGGA AGCAACCTTT
AATGGCGCAC CACAAACCAA TGGGAAGCTC TTCCTAATTA GTGATAACCA AAGCACAGTA
CGGATGTTGG TGAATGCTAG CAATGGCGCA CGCACTAAAC AAGATAGTGA TGCTTGGGGC
AATTTTGTGC CTGCGGCAGG CCAAACCCTA GCACCATCGT CAATTCGCTA CACCGGCGAA
TACACCGACC GTGACACAGG CTTGGTGTTC TTACGGGCAC GCTGGTACAA CCCTGCTAGC
GGAACCTTGT TGAGCAAAGA TCCATTTGCT GGTTTCGCCA ATCAACCTCA ATCACAACAT
CCCTACATCT ACGCAGGCAA TAACCCAACC ACTAATAGCG ACCCAACCGG GCGCACCTGT
GAAGGCTGTC AACCAGCAGG GGTAACCCGC GATCAATGGC ATCAATATGT TGATCTGTAT
GATGTAACCT ATTCAGGGTT TGTAGGGGCG TTGATCAATC ATCCATTTGA TCGGATGGGT
GATGCAACGG TCTTCGATAA CTATGCCCAT GCGACTCGCT CATCCCTCGA TTTCATCTAC
TATCTCAAGA ACTTCTTATG GGGTTGTTCG ATCCAACAAT ACGGAGCCTT AGAGAATCTA
TGGTCAAATG CCGAAACATG GTCAGAACAC TTTTACCATG ATCGGCTGCC ATCGGTTCCA
TTGCATCGCT ATGATAGTAT CGAAGATGCG ATGGATGCCT TGGGTGATGA GGCATATCTC
AACTATTTCA TTGAAACAAA TCTTGCTGAT GGCAATCATC TTGATCCAAT TGACTCGATA
AGCCCAGGGC TTGCCGATGT CCCGGTAGTC CCATCTACAC CACGGCAGAA TAATCCATCG
ACCGATGTTG ATGATCCGGC TCCAGCGAAA TCGCCAAAAC CTAGTAATGA CCCCGATGTC
GATGTTGACA ATCCAGCCCC AAGCATATCA CCCAAAGTCG GAGAATCTTG TAGCTTTGAT
GCGACAACGT TGGTTGCGAC CGATGAAGGG CTGGTTCCAA TTGCCACGAT TCAAGTTGGC
GATCTGGTCT TAGCCTACGA CGAAGCTACT GGCACAACTG ACTACTACAC CGTTACCGCC
AGCTTTGTGC ATAGTGATCC AGTCTTAGTT GACGTTGCCA TTGACAATGA ATGGATCAAC
ACAACTCCCG AGCATCCCTT CTTCACCGCC GATGGCTGGA CAGATGCTGA AAACCTTGAA
CTTAGCGACT GGGTTGCCAG CGCTGATGGC GAGTGGGGAA AAATCACCGC GTTGCGCCTA
CGTGGCGACC AACGCTTTAT GTACAACCTT ACGGTTGCCG AAGCTCATAC CTTCTTTGTT
GGTGATGGTC GCTGGTTAGT CCATAATGTT TGTATTGATG GTGTCGAATA TCCTGTCGAT
AAACCAACTA ACCAAACCAA AAATGACTCT ATTTGGTATG ATACTCAAGC GCAGGCACGC
CAAATGGCAC GTCAAAAAAT GGGACATAAT CCTGTTGAAA TTGAACCAAA TAAGCTACGG
CGCAGGGATG GTACATGGCA ATATCGTGCA AAACCAGGAG ATTTAGCAGG TGATAACCAA
GGCCCACACA TTCATTTGGA AAAACTAGAT CCCAAGACAG GCGATGTACT TATCAACCTA
CACTTACGTT GGAGAAAATA A
 
Protein sequence
MLPIRRRRWI PLALLLIVIV SLLPLQSTSA IPMLPQSDAL TKRVPEILDE PPVNNRPSPA 
QSAVTPKEEA QNIAPVAESQ HSFSGIDLTF TGDTISQVPV AATNAAVYVA QATVNDLYIV
NGQPHYNSDN LSYYEDTGIF MNLPRDSFTV NVEVAVGNIG FIYIQSVNNP QIYQKAAFTT
PQGNPHDMHF TIGPSIPRNQ PIHIYVGCFN PQPGGYNNCR YTNFQFYLDS DVLTRIPDPA
YDHPKKLLPS YINQVNTGKY LPADGAGNPF FMYGYLPPVQ GIDEKFLYQT KAFNFPALQP
NDTITINFRY SAFSWPGTTP ANSFGSHINL FFSDANSSTY DVFGWDTLSR DLTNPINIPI
PGFSTYSYIP WRSGSITLNQ NIYSQISNKT IRINLAPQSA DSQYYPYLAG IDSIQFYRNG
KLMNFSSIED RIPADQNGGS CAPCTAAGES TMIVGDPVNT LSGAYIEHGV DQQIPTGGTA
LSLQRTYVST LANSSLYPQS SLGLGWRFNY AESLTLPVEI TGVTTVGAES NTVIYEAADG
NRYRFKRHGN DFVAARGIRL TLTKNATSYT LQAADQSAKT FDLQGRLLQL RDVYGRTQTI
TLGTSGIQTG LPVEVRDDLS QRKLRLEYTT TSGVVRLYRL LNDLNQATVY EYTQGRLEWV
TSPQSILSRY TYDPTSLLMS SVTRNATSKS PLPTADLVMS YTNGRVTQQT APADNTVWAY
TYTGTGANQA QTTTMTVSRG GIVLDVQRSH YRADGTFAWQ ERNDALLEYV ANDRMLAPTA
GVKADKSIAL HHNNPKGQPV EVYAGAIEGA ILGTTQRTFA ISYTPTGYPA IVTNTNGLET
YTTYNSANQP LTTKAGTGTL VPTQTWTYDA ATKQPQTITS PDGIVTRYTY TAAGQVASTE
VGYGTSSVQK TTYQYDSLGR QTHMTLGDGL PNATTTVTEY RPDNQIAKIT RNYVAGNTTD
PRKNVVTEYG YNAKGQLIWT KTPDGRYRGV TSYDALGRVR WVADNTVNPS TGAIAIADTS
STSLPPSFSP QRPDANIVTM YGYDWHGRTT LITQTGIVTG TFNVATLQWQ SSTSRVTRIE
YDQLSRPVTT TYNYRPDIYA GQFNTNHPDV NVPTYTYYDG AGKVTWTRDG LRRWNHFEYD
QLGRPVTTTL NYENGDPLTV DPVNATWAST NDTDIISITH YDQGGQIDHT IRNFVDGVRD
TTIPDSSIPW QITDVVTDVV TRYHYDQAGR MDQTISNYVA GATDPEFNRV ANTIYDQATG
RTLGVTDAWN QYTVYEYDQL GRSVSSVTNC RTSSGVPTYN RNTCASTTSQ RNLPSPLSVY
DQLGRLSQTL SVDAIGGLGF NGPPTQYSYD GLGQLVETIV NSQPGQAASA TVNVKSTTTY
LDAAGSRWNE TDPTGKITQY ETDGFGQVNK VIDPAGLITL SGPRWTKTPD GQVRVAMIDG
LGRTIKTVSN YQDGVYVPAT DTTTHDIITL TRYDVGGRQV AVTNSLNIRT RYDYDLRDNL
IRIIENDLGT CTASDTNDCQ VTTEYRYDRV GNLIKTIDAR GYSTTKSYDS LGRVRRTTDG
LNRHTLLTYN QDDTVATIAP ATGNPISYSY DEVGHQIQAT GWDSTAVQQW TYDLAGRLSS
VYDASAQDAA AGRTGTISYK YDVLNRISSV AQSLEGDPQG NWTLTYGYDA AGRTTAIGGT
SYSYDTAGRL SSVIRGGSPL AQYAYQGSTG RVASITRLEA GNNRLIESLG YDTRGRVTSR
SITGATTAAA MPTPTAPQLA QFTYAYDRAD QPTSFTETQL NSAGTATTNE THSYTYDKLN
RLISENASGT TTTFRFDAAG NRIKVNSQNS SYNAANERVG FAYDVYGNVT GDGQNVFSYD
GFNRLKSVIK GANVYSYRYH EETLTSRFVN GTRTQAFNYD RVGTSLSSIL RIHTIQPTDT
LYTTYITGLG GDIIQSEATF NGAPQTNGKL FLISDNQSTV RMLVNASNGA RTKQDSDAWG
NFVPAAGQTL APSSIRYTGE YTDRDTGLVF LRARWYNPAS GTLLSKDPFA GFANQPQSQH
PYIYAGNNPT TNSDPTGRTC EGCQPAGVTR DQWHQYVDLY DVTYSGFVGA LINHPFDRMG
DATVFDNYAH ATRSSLDFIY YLKNFLWGCS IQQYGALENL WSNAETWSEH FYHDRLPSVP
LHRYDSIEDA MDALGDEAYL NYFIETNLAD GNHLDPIDSI SPGLADVPVV PSTPRQNNPS
TDVDDPAPAK SPKPSNDPDV DVDNPAPSIS PKVGESCSFD ATTLVATDEG LVPIATIQVG
DLVLAYDEAT GTTDYYTVTA SFVHSDPVLV DVAIDNEWIN TTPEHPFFTA DGWTDAENLE
LSDWVASADG EWGKITALRL RGDQRFMYNL TVAEAHTFFV GDGRWLVHNV CIDGVEYPVD
KPTNQTKNDS IWYDTQAQAR QMARQKMGHN PVEIEPNKLR RRDGTWQYRA KPGDLAGDNQ
GPHIHLEKLD PKTGDVLINL HLRWRK