Gene Haur_1441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1441 
Symbol 
ID5733305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1667443 
End bp1673859 
Gene Length6417 bp 
Protein Length2138 aa 
Translation table11 
GC content50% 
IMG OID641278579 
Producthypothetical protein 
Protein accessionYP_001544213 
Protein GI159897966 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01451] conserved repeat domain
[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCTGC CCAATCATCG TTGGAAGCGT TTAAGCATGC TCCTACTGGC ATGTAGCTTG 
GCGCAAATGC TGGTTCCTAC TACGAATGCC CAACAAACGG CGATTGCTCC GCAACGGCAG
GAGCTGCCAT TAAGTAAAGC CGTCGATTAT GCGCTACCCA AGCAGCAGGC GGTGCAACTG
GCAAACACCA AAGCATTGGC TGCTCCTGGG TTAACCTTCG AGCTTGATCT GCCCAACCAG
GTCAGCATCG GCCAAAATGT GCCGTTTACG CTCACGCTGA TCAACCGGAG TACCACCGAA
GGCCAAAATA TTGTTGTCTC GTTGCCAGTT CCGGCTGGAA CGCAAGCCTT AAAAAATCCC
AATCTGGTTG ATCAAACGCA CTGGCAATGG CAGATTAAAC GCTTAGCACC GCAAAGCCAA
CAGGCATTAA CCGGCAGCGT GCGCATTACC AGTCGCCCTG AACATGGCGC ATTATTGTTT
AATCCGCAAG CGAACGCCAA CAACTTGCCA CAGCCGGCCA CGCTCAGCGC TGGCGCATTG
ATTGCGCCAC AAACTTCAGC TAAACAAGCC GAGATTTGGC AAGCTCAGGG CCAAACCATC
CACAGCAACG ACCAACGGGT CAGCTTGGTA TTACCTGCTA GCGCCACTTC GGCCAACTTT
AGCTATTCGC CCTATCAAAG CACAGCCAAT CGTCAAAAAG CCCTCGGCTA TAGCGAATTT
GTGCCCGAAC GCCGTGGGTT TAGCAGCTTT GAAATTCAGG CTGCCCAAAA ATCGTTTGCG
CCTGTTGAAT TGCAATTTCA CTATAGCCCT GAAGAATTGC AAGTGCTTGG ATTGCAAGAG
CAAACCCTTC GTTTGTTTAC CTATGATCAA ACGAGCCAGA GTTGGCAAAG CATCGCCACC
ACGATTGATT CAATTAATCA TGTGGCCAAA GCGACGATTA CCAACGACGG AGCCTTCAAT
TTAAGCGATG GTTCTTCGCC ATCAACCGCC TATTTGCCAA CCCTGCAAGG CTTTCAAGGC
GGCGGCTTTA CTGGTGCGGC CAGCTATAGC GTGCCGATCG AGGTTCCAGC TGGAGCTGCT
GGGCATAAAC CAGCGGTTAA TCTGAGCTAT TCCAGCGCTG CATCCGATGG CAGCAACGGA
GCACGGCCAT TATGGCAGGC CAGCTCAGTT GGCAAAGGCT GGGATTTAGG CATCGGCGGA
TCGATTGGCC GCAACACCAG CGCAGGCTCC AGCGACCATC AATGGGATAG CTTTAGCCTG
GTTTTCGATG GTCAATCATC GGATATGGTA CGTGGCCAGC CGCTCGATGG CAATTACACC
AATCTTGCCG AATCGAACTG GACATGGCAT GCCACCGATG AAACGTTTGT TAAACTACGT
AAATCGGCCA GCAGCGACAC CTGGACGGCT TGGACCAAGA ATGGCACACG CTACGAATTC
AATCAAGCAC TGCGCTGGGG AACCAACACC CCATCCAATC GCTTTGAAAC CTACAAATGG
TTGCTGACCA AAGTGGTTGA TCCAAGTGGC AATGCGATTG TCTATCAATA TTATGTTGAC
ACAATTCAAG CTGCCGAGCC AATTCACCCA ACATGGTTTT TGCAAAACGT CTTTTGGGGC
TACGATGGCG CAACCCCTGG CACGGGCACA CCTCGCTATG CTCTCTCGTT CGACATGAGT
CCACGCTGGT CGGCTCCAAC TGAAAATGTC GATCTGAACT GGGAGTACAC CACTTCGCGG
GTGATGGGCA AGGCTGGCAC GCCACACGAA GCCTATCGCA TCGACAAAAT TAAAGTGCTG
AGCATGCCAC CTGGCCAAAC CAGCTATCAA TTAATGCGAG CCTATAAGCT CAACTACGCA
GCCTTTGCCA ATAGCCTCAC CACTGATGTT AGCAATGGGC AACGGGTATT GACCTTGGCC
AATGTTCAGC GGCTCGGCAA AAATAACGAA GCCTTGCCTG CAATCAGCTT TAACTATGGC
ATGAGCCAAG GCAATGCACT TGAGCCAACC CCAGGCTGGA ACCGCCTAAC CCAAGTTGAT
AACGGCCAAG GCGGTCTACT AACCATCAAT TACGAGCATG TTTGGAGTCA AGGAGTTACT
GATTATGCCA AGTATTACCA GAATTATTAT CGTGTGGCCA ACGTCGTTCA AGCTGACACC
AGTAATTTAG CCTATAGCCG CGCCTATTTG ACCACCTACA GCTATGCCTA TCCCGCGTTG
AACGATTATG GCCATTCTTT AGCAGTGGTC TATGCGCAAT ACCCAGAAAG TGGCAATGGC
GATAGTCGTT TTGCCTTGGC ACACGCCGAA AAAAGCGAAT TTCGCGGGCA TGCCCAAGTC
AGCGAGCGGG TGTATGATGG CAACACAACT GCCGCACCAC TGTTGCGCAC CACCGAAACA
TGGTTCCATC AAGGCAACGG AGGCAGTGCC GCCACTCCTT GTTCGCCAGC GATTATCACA
CCTGCGTCTG GCAACCAATA TGTGAATGTG AGCGATAGCT GCTACATTGC CATGCGCGAT
AGCGAAAGCT GGAAGGGCAA GGTTATTGCT CAAGAAACCT GGTTCAACGG ATCGCGTTTG
AGCCGCACCG CCAATAGCTA TACCCGCGTC GCCTTGCCGT TCTATGGCAA CGACGGCAAT
GCCACCCACG CCAGCCAAAG CAACAACTAC AAACGGGCTG GCTTATGGCG AGCCTTCAAC
TATACTGCCG CAATCACCGA AAGCACCTAT GAAGCAGGCA CAACCAATGC CCGCAGCCTC
ACTACCAACT TCACCTATGA GCCAACCTAT GGCAACTTAA CTGAAAAACG AGTTGCTGAT
GATACTGGCA CAGTTTTGCG CAAAGAACTT GCCTGGTATG CAACCCGCGA CGATGCCAAC
AGCTACATCG TTGATCGGCC ATGGCAAACT GCCACAACCG ATGGCGCTGG CCGTTATATG
GCACTTTCAG CCAATTTCTA CGATGGTGCA ACTAACACCA ATCTGCTTGG CACACGCGGT
TTACTCACCC GCCAAAGCCA CTACTTTAAT GTGCCACTGC AAACCGACTT AACTGGTACC
ACCTTATATG GCTCCGATGC GGTGTATGGC TACGATCAGT ATGGCAATCG CACAAGCCAG
GCCAGCTATA ATGCCGATTA CAGCACCCGC ATCAATACTA ATGGCGTGGT CAGCTATGGC
GTGCCAGGCA AGGGAACTGC CGCCCGTACC AGCACCATCG AATACGATAA CACCTACCAT
GCCTTCCCAA TTCGCGAGAC TAATCCGCTT AACCAGAGCC AACAAGCCGA ATACGACTAT
CAACTTGGCA CAGTCACGAA AGTATTTGAT CTGAATGGCA ATGCCACTTC AGCAACCTAC
GATAGCTTTG GCCGTATGAC CAATTTGATC AAACCAGGCG ATAGTAGCAG TTTCCCAACG
GCGCATATCG ATTATTATGA TAGCTATCGC CCAATTTTAT ACCTCGTGAG CCTGCGCGAA
GATGCTGGTC AGAGTTACTT TCGGCCAATT TTACACTTCT ACAATGGCTT TGGCCAAGAG
ATCCAAACCA AAGCTGAGAG CATCGACGGC AGTCAACACA TTGTCAACGA TACCAGCTTC
GATGGCTTGG GCCGAGCCAC CAGCCAATCC GAACCGCGCT ATGTGAGCGA TACAACGAAT
TTCTGGGGCT ATGTTCCAGC CAGCAATCCA CTCTATCAAG CAACAACGAC CAGCTTCGAT
GGCTTGAATC GGCCATTGGT CATCACTAGC CCTGGCAATA GAACGGTCGA GCATCATTAT
GGCGTAACCA CGAGCTTTTT GTATGATGAT GTGATCGATC AAAATCGTCA TCGCACGCAG
TATCGCTACG ATGTGTTTGG GCGGTTGCGC GAGGTCAACG AAATTAGCGG CAACTGCGCT
ACTGGCTATT GGGCAAACTA CGCGTGTGGC GGTTCATTCA CCACCAATTG GTCAGTCGCT
ACCAACACGC GCTACGGCTA CGATGCGCTT GATCGGTTGA TAACGGTGAT CGATGCCCAA
AATAACACCA CCAGCATGCG TTACGACTCA CTTGGTCGCA AAATGCGTAT CCAAGATCCT
GATATGGGCC AATATAACTA TAGCTACGAT GCTGCTAGCA ACTTAAATGG CCAAACCAAT
GCCAAAGGCC AAACGGTCAG CTTTAACTAC GATGCGCTCA ACCGTTTGAC CAGCCAAGTC
TTCCCTGATG GTAGCCATAA CGATTATTTC TATGATGTTG TTGGGCAACA AACCGCTGGC
TACAACTATG GCAAAGGCCA CCGTACCTCG ATGCAAAGCG TGCTGGCCAA TGGCACAATC
CAGACTTTCC AGCGCTGGGA GTATGATGCT CGTGGCCGCG AAATCTTCAG TGGTCACAAC
ACCGATCTCA CCAAAGCCCA CCATATCCTG ACCAGCTACG ATAGCGCTGA TCGGGTCAAA
ACCCGCCGTT ATCAGCCAAT CGACGAAACC GTAACTTACA ACTACGACGC TGCTTGGAAT
GAATATAGCC TGTGTACATC GCTCGGTGGT TGTTATGTGA CCGGAGCCAG CTATGATGCG
CTCAACCAGC CAACCCGCGT CAACTATGGC AACGGCAGCT ACAACCAATA TCGTTACGAC
GACTCAACGC GGCTGTTAGA AGCGCTCGAA ATCTTTTCAG CCCAAGGCAC CAATCTCTAT
GCCCGCAACT ATTGGTACGA TAACGTTGGT AATATCAATG GCATCGGCAC ATGGGACAAT
GGCAGCAATA ACCAAACCCG CCAAATTCAA TATTTCAACT ACGACGACCA AAATCGCCTA
ACTCGCGCTT GGACAACTGG CGATAGCGCT GGCGCATACG ACCAAAGCAT GAGCTATGAC
TCAATCGGCA ACTTATTGAG CAAAGCTGGT GCCGCTTATA CGTATCCGGC TGCTGGCAGC
GCTCGACCAC ATGCCGCCAC CACAATTGGC AGCAAAAACT ACAACTACGA TGCCAATGGC
AATCTCAGCA CCACCTACAC AGGCACAAGC ACTAATGGCA GCGGCAACCG CTATAGCTGG
GATTACGCCA ACCGCCTCGT GCAAGTTGAA TCGTATGTGC GTGCACCTAT CAGCGGTGGC
GGTAGTGATT GCGATGATGG CAACGAACGC ATCCCAGATG CCAGTAATCC TGAGGCAATT
CCACCAATTA CCTGCCCACA TCGCAGCGCC AGCCCCAACC AACCAACGGT TGATAGCTAT
ACCGTTCAAG AACAATATGT GTACGATGCT GATGGCAAAC GCAGCGCTCG CATCGCCAAC
GGCCAAACAA TCATCTACTT TGAAGGCGCA TGGGAAGATA CGCTTGGCGT AAACGCTCGC
AAACTCTATA CATTCAATGG CAGCATCGTG GCCCAACGTG ATAGCGATAA CACCATGAGC
TATCTGCATG GTGATCAGCT TGGCTCGGTC AGCATCGTGA CCAACGCCAG CGGTGGACTG
AAGCACAAAC AAGAATTCGA CCCATGGGGC AATATTCGCG AAGGCGGAGC CAGTAGCACC
AAACTCAACT ATACGGGACA ATATCGCGAT GATACGGGCT TGATCTTTAT GAATGCACGC
TACTACGACC CCAAGATTGG TCGCTTTATC AGCGCCGATA CGGTTGTGCC TGGCAGCCCA
AGCGGCTCGA TGAATGGCAG CGGTTTACGC TCATTGACCA CCGATTTCAC CGACCCAGCC
TTTACCATGA GCCTTAACCA AGAACAACGC AGCACACCAT GGTTTACCTT GAGTCAAGCA
CAACAACAGG ATATTGGCAC ACCGTGGGGG CCAGCCAACC CGCAAAGCCT CAATCGCTAT
AGCTATGTCC AGAACAACCC ACTAACTTAC ACCGACCCAA CTGGACATGC ACGAACGGTT
ATTAGAACAG CGATCGAAGC CAATCGCGCC TTAACTACTT TAAAAGCGTG GCTCAGAGAC
AGAAATATCG GATTTGAAGT AAAGGGTGCT GAGATAGCAG GTACTACCGA AACTATGGGG
ATTAATTTAC TTGTAGATAT TGATATGCTT GTCCTTCAAC TAGGAACTGA TATGGCTAAC
ATAATCAGAG GTTATATCGA TGCACTCAAT ACAATGGTTA CTAACCCAGA ATTATTCCCT
GATGGTATAT ACCTAGATTA TGATGTAACG AGAAATGGCA AATGGATTAC CACAACCATG
GGATGGGGAT TTTGTGATAA TGCTGACTGT AGTATGACTG GTGATCCTGT ATCACAGGCA
AGTAGTACAT ATAGACATTA CACCTTATTC GAACAATACC TTACAGCAAT TGATCCCACG
TTATTAAGCC AAGGCTTCTA TTTACACAAT CAACGCAATA TGTACATTGC AGACAATATG
AGTACAGTCT TCTTTGGTAT TAGAAAAGGA CAGCAAAGTA AATATAATAA AAAATAA
 
Protein sequence
MILPNHRWKR LSMLLLACSL AQMLVPTTNA QQTAIAPQRQ ELPLSKAVDY ALPKQQAVQL 
ANTKALAAPG LTFELDLPNQ VSIGQNVPFT LTLINRSTTE GQNIVVSLPV PAGTQALKNP
NLVDQTHWQW QIKRLAPQSQ QALTGSVRIT SRPEHGALLF NPQANANNLP QPATLSAGAL
IAPQTSAKQA EIWQAQGQTI HSNDQRVSLV LPASATSANF SYSPYQSTAN RQKALGYSEF
VPERRGFSSF EIQAAQKSFA PVELQFHYSP EELQVLGLQE QTLRLFTYDQ TSQSWQSIAT
TIDSINHVAK ATITNDGAFN LSDGSSPSTA YLPTLQGFQG GGFTGAASYS VPIEVPAGAA
GHKPAVNLSY SSAASDGSNG ARPLWQASSV GKGWDLGIGG SIGRNTSAGS SDHQWDSFSL
VFDGQSSDMV RGQPLDGNYT NLAESNWTWH ATDETFVKLR KSASSDTWTA WTKNGTRYEF
NQALRWGTNT PSNRFETYKW LLTKVVDPSG NAIVYQYYVD TIQAAEPIHP TWFLQNVFWG
YDGATPGTGT PRYALSFDMS PRWSAPTENV DLNWEYTTSR VMGKAGTPHE AYRIDKIKVL
SMPPGQTSYQ LMRAYKLNYA AFANSLTTDV SNGQRVLTLA NVQRLGKNNE ALPAISFNYG
MSQGNALEPT PGWNRLTQVD NGQGGLLTIN YEHVWSQGVT DYAKYYQNYY RVANVVQADT
SNLAYSRAYL TTYSYAYPAL NDYGHSLAVV YAQYPESGNG DSRFALAHAE KSEFRGHAQV
SERVYDGNTT AAPLLRTTET WFHQGNGGSA ATPCSPAIIT PASGNQYVNV SDSCYIAMRD
SESWKGKVIA QETWFNGSRL SRTANSYTRV ALPFYGNDGN ATHASQSNNY KRAGLWRAFN
YTAAITESTY EAGTTNARSL TTNFTYEPTY GNLTEKRVAD DTGTVLRKEL AWYATRDDAN
SYIVDRPWQT ATTDGAGRYM ALSANFYDGA TNTNLLGTRG LLTRQSHYFN VPLQTDLTGT
TLYGSDAVYG YDQYGNRTSQ ASYNADYSTR INTNGVVSYG VPGKGTAART STIEYDNTYH
AFPIRETNPL NQSQQAEYDY QLGTVTKVFD LNGNATSATY DSFGRMTNLI KPGDSSSFPT
AHIDYYDSYR PILYLVSLRE DAGQSYFRPI LHFYNGFGQE IQTKAESIDG SQHIVNDTSF
DGLGRATSQS EPRYVSDTTN FWGYVPASNP LYQATTTSFD GLNRPLVITS PGNRTVEHHY
GVTTSFLYDD VIDQNRHRTQ YRYDVFGRLR EVNEISGNCA TGYWANYACG GSFTTNWSVA
TNTRYGYDAL DRLITVIDAQ NNTTSMRYDS LGRKMRIQDP DMGQYNYSYD AASNLNGQTN
AKGQTVSFNY DALNRLTSQV FPDGSHNDYF YDVVGQQTAG YNYGKGHRTS MQSVLANGTI
QTFQRWEYDA RGREIFSGHN TDLTKAHHIL TSYDSADRVK TRRYQPIDET VTYNYDAAWN
EYSLCTSLGG CYVTGASYDA LNQPTRVNYG NGSYNQYRYD DSTRLLEALE IFSAQGTNLY
ARNYWYDNVG NINGIGTWDN GSNNQTRQIQ YFNYDDQNRL TRAWTTGDSA GAYDQSMSYD
SIGNLLSKAG AAYTYPAAGS ARPHAATTIG SKNYNYDANG NLSTTYTGTS TNGSGNRYSW
DYANRLVQVE SYVRAPISGG GSDCDDGNER IPDASNPEAI PPITCPHRSA SPNQPTVDSY
TVQEQYVYDA DGKRSARIAN GQTIIYFEGA WEDTLGVNAR KLYTFNGSIV AQRDSDNTMS
YLHGDQLGSV SIVTNASGGL KHKQEFDPWG NIREGGASST KLNYTGQYRD DTGLIFMNAR
YYDPKIGRFI SADTVVPGSP SGSMNGSGLR SLTTDFTDPA FTMSLNQEQR STPWFTLSQA
QQQDIGTPWG PANPQSLNRY SYVQNNPLTY TDPTGHARTV IRTAIEANRA LTTLKAWLRD
RNIGFEVKGA EIAGTTETMG INLLVDIDML VLQLGTDMAN IIRGYIDALN TMVTNPELFP
DGIYLDYDVT RNGKWITTTM GWGFCDNADC SMTGDPVSQA SSTYRHYTLF EQYLTAIDPT
LLSQGFYLHN QRNMYIADNM STVFFGIRKG QQSKYNKK