Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1441 |
Symbol | |
ID | 5733305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1667443 |
End bp | 1673859 |
Gene Length | 6417 bp |
Protein Length | 2138 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278579 |
Product | hypothetical protein |
Protein accession | YP_001544213 |
Protein GI | 159897966 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3209] Rhs family protein |
TIGRFAM ID | [TIGR01451] conserved repeat domain [TIGR01643] YD repeat (two copies) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCTGC CCAATCATCG TTGGAAGCGT TTAAGCATGC TCCTACTGGC ATGTAGCTTG GCGCAAATGC TGGTTCCTAC TACGAATGCC CAACAAACGG CGATTGCTCC GCAACGGCAG GAGCTGCCAT TAAGTAAAGC CGTCGATTAT GCGCTACCCA AGCAGCAGGC GGTGCAACTG GCAAACACCA AAGCATTGGC TGCTCCTGGG TTAACCTTCG AGCTTGATCT GCCCAACCAG GTCAGCATCG GCCAAAATGT GCCGTTTACG CTCACGCTGA TCAACCGGAG TACCACCGAA GGCCAAAATA TTGTTGTCTC GTTGCCAGTT CCGGCTGGAA CGCAAGCCTT AAAAAATCCC AATCTGGTTG ATCAAACGCA CTGGCAATGG CAGATTAAAC GCTTAGCACC GCAAAGCCAA CAGGCATTAA CCGGCAGCGT GCGCATTACC AGTCGCCCTG AACATGGCGC ATTATTGTTT AATCCGCAAG CGAACGCCAA CAACTTGCCA CAGCCGGCCA CGCTCAGCGC TGGCGCATTG ATTGCGCCAC AAACTTCAGC TAAACAAGCC GAGATTTGGC AAGCTCAGGG CCAAACCATC CACAGCAACG ACCAACGGGT CAGCTTGGTA TTACCTGCTA GCGCCACTTC GGCCAACTTT AGCTATTCGC CCTATCAAAG CACAGCCAAT CGTCAAAAAG CCCTCGGCTA TAGCGAATTT GTGCCCGAAC GCCGTGGGTT TAGCAGCTTT GAAATTCAGG CTGCCCAAAA ATCGTTTGCG CCTGTTGAAT TGCAATTTCA CTATAGCCCT GAAGAATTGC AAGTGCTTGG ATTGCAAGAG CAAACCCTTC GTTTGTTTAC CTATGATCAA ACGAGCCAGA GTTGGCAAAG CATCGCCACC ACGATTGATT CAATTAATCA TGTGGCCAAA GCGACGATTA CCAACGACGG AGCCTTCAAT TTAAGCGATG GTTCTTCGCC ATCAACCGCC TATTTGCCAA CCCTGCAAGG CTTTCAAGGC GGCGGCTTTA CTGGTGCGGC CAGCTATAGC GTGCCGATCG AGGTTCCAGC TGGAGCTGCT GGGCATAAAC CAGCGGTTAA TCTGAGCTAT TCCAGCGCTG CATCCGATGG CAGCAACGGA GCACGGCCAT TATGGCAGGC CAGCTCAGTT GGCAAAGGCT GGGATTTAGG CATCGGCGGA TCGATTGGCC GCAACACCAG CGCAGGCTCC AGCGACCATC AATGGGATAG CTTTAGCCTG GTTTTCGATG GTCAATCATC GGATATGGTA CGTGGCCAGC CGCTCGATGG CAATTACACC AATCTTGCCG AATCGAACTG GACATGGCAT GCCACCGATG AAACGTTTGT TAAACTACGT AAATCGGCCA GCAGCGACAC CTGGACGGCT TGGACCAAGA ATGGCACACG CTACGAATTC AATCAAGCAC TGCGCTGGGG AACCAACACC CCATCCAATC GCTTTGAAAC CTACAAATGG TTGCTGACCA AAGTGGTTGA TCCAAGTGGC AATGCGATTG TCTATCAATA TTATGTTGAC ACAATTCAAG CTGCCGAGCC AATTCACCCA ACATGGTTTT TGCAAAACGT CTTTTGGGGC TACGATGGCG CAACCCCTGG CACGGGCACA CCTCGCTATG CTCTCTCGTT CGACATGAGT CCACGCTGGT CGGCTCCAAC TGAAAATGTC GATCTGAACT GGGAGTACAC CACTTCGCGG GTGATGGGCA AGGCTGGCAC GCCACACGAA GCCTATCGCA TCGACAAAAT TAAAGTGCTG AGCATGCCAC CTGGCCAAAC CAGCTATCAA TTAATGCGAG CCTATAAGCT CAACTACGCA GCCTTTGCCA ATAGCCTCAC CACTGATGTT AGCAATGGGC AACGGGTATT GACCTTGGCC AATGTTCAGC GGCTCGGCAA AAATAACGAA GCCTTGCCTG CAATCAGCTT TAACTATGGC ATGAGCCAAG GCAATGCACT TGAGCCAACC CCAGGCTGGA ACCGCCTAAC CCAAGTTGAT AACGGCCAAG GCGGTCTACT AACCATCAAT TACGAGCATG TTTGGAGTCA AGGAGTTACT GATTATGCCA AGTATTACCA GAATTATTAT CGTGTGGCCA ACGTCGTTCA AGCTGACACC AGTAATTTAG CCTATAGCCG CGCCTATTTG ACCACCTACA GCTATGCCTA TCCCGCGTTG AACGATTATG GCCATTCTTT AGCAGTGGTC TATGCGCAAT ACCCAGAAAG TGGCAATGGC GATAGTCGTT TTGCCTTGGC ACACGCCGAA AAAAGCGAAT TTCGCGGGCA TGCCCAAGTC AGCGAGCGGG TGTATGATGG CAACACAACT GCCGCACCAC TGTTGCGCAC CACCGAAACA TGGTTCCATC AAGGCAACGG AGGCAGTGCC GCCACTCCTT GTTCGCCAGC GATTATCACA CCTGCGTCTG GCAACCAATA TGTGAATGTG AGCGATAGCT GCTACATTGC CATGCGCGAT AGCGAAAGCT GGAAGGGCAA GGTTATTGCT CAAGAAACCT GGTTCAACGG ATCGCGTTTG AGCCGCACCG CCAATAGCTA TACCCGCGTC GCCTTGCCGT TCTATGGCAA CGACGGCAAT GCCACCCACG CCAGCCAAAG CAACAACTAC AAACGGGCTG GCTTATGGCG AGCCTTCAAC TATACTGCCG CAATCACCGA AAGCACCTAT GAAGCAGGCA CAACCAATGC CCGCAGCCTC ACTACCAACT TCACCTATGA GCCAACCTAT GGCAACTTAA CTGAAAAACG AGTTGCTGAT GATACTGGCA CAGTTTTGCG CAAAGAACTT GCCTGGTATG CAACCCGCGA CGATGCCAAC AGCTACATCG TTGATCGGCC ATGGCAAACT GCCACAACCG ATGGCGCTGG CCGTTATATG GCACTTTCAG CCAATTTCTA CGATGGTGCA ACTAACACCA ATCTGCTTGG CACACGCGGT TTACTCACCC GCCAAAGCCA CTACTTTAAT GTGCCACTGC AAACCGACTT AACTGGTACC ACCTTATATG GCTCCGATGC GGTGTATGGC TACGATCAGT ATGGCAATCG CACAAGCCAG GCCAGCTATA ATGCCGATTA CAGCACCCGC ATCAATACTA ATGGCGTGGT CAGCTATGGC GTGCCAGGCA AGGGAACTGC CGCCCGTACC AGCACCATCG AATACGATAA CACCTACCAT GCCTTCCCAA TTCGCGAGAC TAATCCGCTT AACCAGAGCC AACAAGCCGA ATACGACTAT CAACTTGGCA CAGTCACGAA AGTATTTGAT CTGAATGGCA ATGCCACTTC AGCAACCTAC GATAGCTTTG GCCGTATGAC CAATTTGATC AAACCAGGCG ATAGTAGCAG TTTCCCAACG GCGCATATCG ATTATTATGA TAGCTATCGC CCAATTTTAT ACCTCGTGAG CCTGCGCGAA GATGCTGGTC AGAGTTACTT TCGGCCAATT TTACACTTCT ACAATGGCTT TGGCCAAGAG ATCCAAACCA AAGCTGAGAG CATCGACGGC AGTCAACACA TTGTCAACGA TACCAGCTTC GATGGCTTGG GCCGAGCCAC CAGCCAATCC GAACCGCGCT ATGTGAGCGA TACAACGAAT TTCTGGGGCT ATGTTCCAGC CAGCAATCCA CTCTATCAAG CAACAACGAC CAGCTTCGAT GGCTTGAATC GGCCATTGGT CATCACTAGC CCTGGCAATA GAACGGTCGA GCATCATTAT GGCGTAACCA CGAGCTTTTT GTATGATGAT GTGATCGATC AAAATCGTCA TCGCACGCAG TATCGCTACG ATGTGTTTGG GCGGTTGCGC GAGGTCAACG AAATTAGCGG CAACTGCGCT ACTGGCTATT GGGCAAACTA CGCGTGTGGC GGTTCATTCA CCACCAATTG GTCAGTCGCT ACCAACACGC GCTACGGCTA CGATGCGCTT GATCGGTTGA TAACGGTGAT CGATGCCCAA AATAACACCA CCAGCATGCG TTACGACTCA CTTGGTCGCA AAATGCGTAT CCAAGATCCT GATATGGGCC AATATAACTA TAGCTACGAT GCTGCTAGCA ACTTAAATGG CCAAACCAAT GCCAAAGGCC AAACGGTCAG CTTTAACTAC GATGCGCTCA ACCGTTTGAC CAGCCAAGTC TTCCCTGATG GTAGCCATAA CGATTATTTC TATGATGTTG TTGGGCAACA AACCGCTGGC TACAACTATG GCAAAGGCCA CCGTACCTCG ATGCAAAGCG TGCTGGCCAA TGGCACAATC CAGACTTTCC AGCGCTGGGA GTATGATGCT CGTGGCCGCG AAATCTTCAG TGGTCACAAC ACCGATCTCA CCAAAGCCCA CCATATCCTG ACCAGCTACG ATAGCGCTGA TCGGGTCAAA ACCCGCCGTT ATCAGCCAAT CGACGAAACC GTAACTTACA ACTACGACGC TGCTTGGAAT GAATATAGCC TGTGTACATC GCTCGGTGGT TGTTATGTGA CCGGAGCCAG CTATGATGCG CTCAACCAGC CAACCCGCGT CAACTATGGC AACGGCAGCT ACAACCAATA TCGTTACGAC GACTCAACGC GGCTGTTAGA AGCGCTCGAA ATCTTTTCAG CCCAAGGCAC CAATCTCTAT GCCCGCAACT ATTGGTACGA TAACGTTGGT AATATCAATG GCATCGGCAC ATGGGACAAT GGCAGCAATA ACCAAACCCG CCAAATTCAA TATTTCAACT ACGACGACCA AAATCGCCTA ACTCGCGCTT GGACAACTGG CGATAGCGCT GGCGCATACG ACCAAAGCAT GAGCTATGAC TCAATCGGCA ACTTATTGAG CAAAGCTGGT GCCGCTTATA CGTATCCGGC TGCTGGCAGC GCTCGACCAC ATGCCGCCAC CACAATTGGC AGCAAAAACT ACAACTACGA TGCCAATGGC AATCTCAGCA CCACCTACAC AGGCACAAGC ACTAATGGCA GCGGCAACCG CTATAGCTGG GATTACGCCA ACCGCCTCGT GCAAGTTGAA TCGTATGTGC GTGCACCTAT CAGCGGTGGC GGTAGTGATT GCGATGATGG CAACGAACGC ATCCCAGATG CCAGTAATCC TGAGGCAATT CCACCAATTA CCTGCCCACA TCGCAGCGCC AGCCCCAACC AACCAACGGT TGATAGCTAT ACCGTTCAAG AACAATATGT GTACGATGCT GATGGCAAAC GCAGCGCTCG CATCGCCAAC GGCCAAACAA TCATCTACTT TGAAGGCGCA TGGGAAGATA CGCTTGGCGT AAACGCTCGC AAACTCTATA CATTCAATGG CAGCATCGTG GCCCAACGTG ATAGCGATAA CACCATGAGC TATCTGCATG GTGATCAGCT TGGCTCGGTC AGCATCGTGA CCAACGCCAG CGGTGGACTG AAGCACAAAC AAGAATTCGA CCCATGGGGC AATATTCGCG AAGGCGGAGC CAGTAGCACC AAACTCAACT ATACGGGACA ATATCGCGAT GATACGGGCT TGATCTTTAT GAATGCACGC TACTACGACC CCAAGATTGG TCGCTTTATC AGCGCCGATA CGGTTGTGCC TGGCAGCCCA AGCGGCTCGA TGAATGGCAG CGGTTTACGC TCATTGACCA CCGATTTCAC CGACCCAGCC TTTACCATGA GCCTTAACCA AGAACAACGC AGCACACCAT GGTTTACCTT GAGTCAAGCA CAACAACAGG ATATTGGCAC ACCGTGGGGG CCAGCCAACC CGCAAAGCCT CAATCGCTAT AGCTATGTCC AGAACAACCC ACTAACTTAC ACCGACCCAA CTGGACATGC ACGAACGGTT ATTAGAACAG CGATCGAAGC CAATCGCGCC TTAACTACTT TAAAAGCGTG GCTCAGAGAC AGAAATATCG GATTTGAAGT AAAGGGTGCT GAGATAGCAG GTACTACCGA AACTATGGGG ATTAATTTAC TTGTAGATAT TGATATGCTT GTCCTTCAAC TAGGAACTGA TATGGCTAAC ATAATCAGAG GTTATATCGA TGCACTCAAT ACAATGGTTA CTAACCCAGA ATTATTCCCT GATGGTATAT ACCTAGATTA TGATGTAACG AGAAATGGCA AATGGATTAC CACAACCATG GGATGGGGAT TTTGTGATAA TGCTGACTGT AGTATGACTG GTGATCCTGT ATCACAGGCA AGTAGTACAT ATAGACATTA CACCTTATTC GAACAATACC TTACAGCAAT TGATCCCACG TTATTAAGCC AAGGCTTCTA TTTACACAAT CAACGCAATA TGTACATTGC AGACAATATG AGTACAGTCT TCTTTGGTAT TAGAAAAGGA CAGCAAAGTA AATATAATAA AAAATAA
|
Protein sequence | MILPNHRWKR LSMLLLACSL AQMLVPTTNA QQTAIAPQRQ ELPLSKAVDY ALPKQQAVQL ANTKALAAPG LTFELDLPNQ VSIGQNVPFT LTLINRSTTE GQNIVVSLPV PAGTQALKNP NLVDQTHWQW QIKRLAPQSQ QALTGSVRIT SRPEHGALLF NPQANANNLP QPATLSAGAL IAPQTSAKQA EIWQAQGQTI HSNDQRVSLV LPASATSANF SYSPYQSTAN RQKALGYSEF VPERRGFSSF EIQAAQKSFA PVELQFHYSP EELQVLGLQE QTLRLFTYDQ TSQSWQSIAT TIDSINHVAK ATITNDGAFN LSDGSSPSTA YLPTLQGFQG GGFTGAASYS VPIEVPAGAA GHKPAVNLSY SSAASDGSNG ARPLWQASSV GKGWDLGIGG SIGRNTSAGS SDHQWDSFSL VFDGQSSDMV RGQPLDGNYT NLAESNWTWH ATDETFVKLR KSASSDTWTA WTKNGTRYEF NQALRWGTNT PSNRFETYKW LLTKVVDPSG NAIVYQYYVD TIQAAEPIHP TWFLQNVFWG YDGATPGTGT PRYALSFDMS PRWSAPTENV DLNWEYTTSR VMGKAGTPHE AYRIDKIKVL SMPPGQTSYQ LMRAYKLNYA AFANSLTTDV SNGQRVLTLA NVQRLGKNNE ALPAISFNYG MSQGNALEPT PGWNRLTQVD NGQGGLLTIN YEHVWSQGVT DYAKYYQNYY RVANVVQADT SNLAYSRAYL TTYSYAYPAL NDYGHSLAVV YAQYPESGNG DSRFALAHAE KSEFRGHAQV SERVYDGNTT AAPLLRTTET WFHQGNGGSA ATPCSPAIIT PASGNQYVNV SDSCYIAMRD SESWKGKVIA QETWFNGSRL SRTANSYTRV ALPFYGNDGN ATHASQSNNY KRAGLWRAFN YTAAITESTY EAGTTNARSL TTNFTYEPTY GNLTEKRVAD DTGTVLRKEL AWYATRDDAN SYIVDRPWQT ATTDGAGRYM ALSANFYDGA TNTNLLGTRG LLTRQSHYFN VPLQTDLTGT TLYGSDAVYG YDQYGNRTSQ ASYNADYSTR INTNGVVSYG VPGKGTAART STIEYDNTYH AFPIRETNPL NQSQQAEYDY QLGTVTKVFD LNGNATSATY DSFGRMTNLI KPGDSSSFPT AHIDYYDSYR PILYLVSLRE DAGQSYFRPI LHFYNGFGQE IQTKAESIDG SQHIVNDTSF DGLGRATSQS EPRYVSDTTN FWGYVPASNP LYQATTTSFD GLNRPLVITS PGNRTVEHHY GVTTSFLYDD VIDQNRHRTQ YRYDVFGRLR EVNEISGNCA TGYWANYACG GSFTTNWSVA TNTRYGYDAL DRLITVIDAQ NNTTSMRYDS LGRKMRIQDP DMGQYNYSYD AASNLNGQTN AKGQTVSFNY DALNRLTSQV FPDGSHNDYF YDVVGQQTAG YNYGKGHRTS MQSVLANGTI QTFQRWEYDA RGREIFSGHN TDLTKAHHIL TSYDSADRVK TRRYQPIDET VTYNYDAAWN EYSLCTSLGG CYVTGASYDA LNQPTRVNYG NGSYNQYRYD DSTRLLEALE IFSAQGTNLY ARNYWYDNVG NINGIGTWDN GSNNQTRQIQ YFNYDDQNRL TRAWTTGDSA GAYDQSMSYD SIGNLLSKAG AAYTYPAAGS ARPHAATTIG SKNYNYDANG NLSTTYTGTS TNGSGNRYSW DYANRLVQVE SYVRAPISGG GSDCDDGNER IPDASNPEAI PPITCPHRSA SPNQPTVDSY TVQEQYVYDA DGKRSARIAN GQTIIYFEGA WEDTLGVNAR KLYTFNGSIV AQRDSDNTMS YLHGDQLGSV SIVTNASGGL KHKQEFDPWG NIREGGASST KLNYTGQYRD DTGLIFMNAR YYDPKIGRFI SADTVVPGSP SGSMNGSGLR SLTTDFTDPA FTMSLNQEQR STPWFTLSQA QQQDIGTPWG PANPQSLNRY SYVQNNPLTY TDPTGHARTV IRTAIEANRA LTTLKAWLRD RNIGFEVKGA EIAGTTETMG INLLVDIDML VLQLGTDMAN IIRGYIDALN TMVTNPELFP DGIYLDYDVT RNGKWITTTM GWGFCDNADC SMTGDPVSQA SSTYRHYTLF EQYLTAIDPT LLSQGFYLHN QRNMYIADNM STVFFGIRKG QQSKYNKK
|
| |