Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2224 |
Symbol | |
ID | 5734111 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2821969 |
End bp | 2829453 |
Gene Length | 7485 bp |
Protein Length | 2494 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279365 |
Product | YD repeat-containing protein |
Protein accession | YP_001544992 |
Protein GI | 159898745 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGTAA ACCGCCCCTA TCGACGATTG ATTCACGTTG GAATGCTCTT GAGCATGCTC TGGTCGCTCT TCCCAGCGAG CCAGCCAGCC ACCGCCCAAA CTGATCAGTC CAACAAAACG GTTGAGCAAG CGTTTGCTCC CGATACTGGC CTAACACGCG ATCCGCAAAC CCCGCGTTTA GCAACTGCGC TACCAAAAAA AGCCAGCAGC AGGGCTGATT TTATGCCTGT GGCGGCCCAA AACTTAAGCA TTCAGCCGCA ACTTAATCAG GTGGTCACTG CCCAACTTGA TAGCGCCAAA ACCACCGCCG TACAATTCGA GCAAGCGCCG TTGAGCTTGT TGGTTGAGGC CGATACCTTC GCGACCGCTA CGCGCTTGGA GTTTCAAGCT CAAGCCTTGC CAAACTTGAC CCAACAGCTC CAACGCAGCA GCAAAAGTGA AGGCTACCTG CGCGACCAAG CTGAGCAAGT TACGTTCTAT CGCTTTAACA TCGAAGCCCA AACCAACGAT CAAGCTGCCA ACTTCAAAAA ACCAGTGCGC ATGGTGCTCG ATTTGCGCCA ACTCATGCGC GATGTTCCCA GCAATTATCA ACAATTTTAT TTGGCCTACC AAGATCAAGC CGATCCCAAC CACTGGATCG AAGTGCCAAT TACCTTACAC GATGCCAAGG GCTTGATTAG CGCCGATGTT GACCATTTTT CGACATGGGC AGCTGGCACA CGGCCAGAGC GTTGGAATCC CAGCTGGGTT CCAGCAGCAG TTGCCAGTTT TAGCGGAGCA GCCACCTATG CCTACCCAAT TGAAGGCCCA ATGGGTCGTG GTGGCCTGCA ACCAAAAGTC GAATTAAGCT ACAACAGCCG GAGCCTTGAT GGCCGGATTC GCGATGACAC TGGTGCAGGC CCACTGGGCG ATGCTTGGTC AATCAGCGAT ATTAGCATTG CACGGGTTGG GGTAAAAACT GAATTCGTTG CGGGCTTTCC ATCCAACAAA CATCCCGACA ACTTCCGTTT GACGATCAAC GGCGCTGGAC ATGAACTCTT CCCTGAGTTT CCCAGCCAAA CTGCCACTGC CAGCAGCATG CGCTATTTTG CCAAAGATGC TCCAGGCTAC TACATCAAAC GGGTCTATAA CACGGCCACC CCCAATACCG ATGGAATATA TTGGATTGTG GTAACGCCAA CGGGTGTGAC CTATCGTTTG GGTTATTATC CTCACGCCGA AGAACACCAA ATTTGGGATA TTGGCTATTG GAATGTGCAA GGCCATCAAG GCCGCCCCAA TAACGAACGC TCAGCCTTGG CTTGGCATGT CGATACGGTC ACCGATCGCG CTGGCAACCA GATGACCTAT CAATATGTCA ACTGGACAGT CACCGAGCCA ATTGAGTGGT ATCTCCAAGG CTCAAGCCAT AAATCGACCT TGCAATTAAC TACGTGGAAA AGCCGCATCG GCAGCATTAG CTACAATTAC CCAAATCGGG TGACGGCCTT GCCAGTCAGC GATACCGTTG CTCAATTGAG CACTACTCCT GCCAGCCGCT TGGTATTTAC CACCAAAACC CAATTTGCCT ACCTGATCGA CACAATTTAT GTCTATCATG GCTCGTTGAG CACCCCAATT AAAGAATATC GGATTAACTT ATCAGGCCAT TTCGTTGATA GTCCGGCCTG TATGAACCAA GATACTCAGC CGAATATTCC GCGCTCAACT CATACTCGGG TGGTCAATGC GATTACGGTC GCTAGCGGGG TTGATGCCGA CCCAACAACC GAAGATGGTT GGACTTTGCC CGCCACCAGC TTTACCTACG AAGCCAAACC ACATTACAAC AACAATTGTT TCTCTTTCTA CTATCTCAAG AGCATGCGCA GCAATTATGG TGGTGAAATC AGCTTCAATT ACGCCTCGGA TAATCGTTGG ATTGGCGATT ATACCTACCT AGGCTATGAT CGGTATGTTT GGCCAAGTTT GGGCCAAAGC TATTACGTGG TCGAAACTTT AGCCAACGAC GGGCGTAATC CTGCCGTCAA AACCACCTAC AGCTATAGCC AGCCCTGTTA TGGCCAATGG TCAAGCAACG TTCCCGCAGG CGCAATCACT TGTGGCGCAA GCGATGCTCC CGAGTTTGGC ACAATCACTG GCTTTGCCAC GGTCAATCAA CGCAGCTACG ATTTTAACGG CACAACGCTG CTCAAACGCA ACGAAACAAT TTTCTCGCAA AATAGTGCTA CCACCAGTGG CAAGCCCATG ATTCAGCGCA ATTTTGCTGG CGATGGAACG TTGTTGGATG CAACCCACAG CACCTACAAC ACCGATAGCC TGCATGGCTT GCCCAATATG TTCACCTATT TGAGCGAAGT TAAGAGCTAT CAATATAGCA ACGGCCTTGA AATTTCGACC AAGCGCACGT TTGGTTACGA TGTGGCCAAA CAAGCTGGCG TGCAATATGG CAACCTCACC GACACTTGGC TCTATGCTAG TGCTGCTGCA ACGACACCTT ACGAAAAACA AGTTACCTAT TACACGCCCA ACAATGGCCG AGCCACTGGA GGCCAAGCCT GGATGGTCAA CGCGCCCACA GCCAGCGGTC GTTACGATCA ACAAGGCACA TTTTTAAACG GCACATGGAT CTATTATGAT GGGGCCACAA CCAACGCTAC TGCCCCAACC CAAGGCTTGG TCACCCGCAG CCGCCAAACT CGGCCTATTA CTTGTGCTGA GATTCCAAAT CCAAGCGGCT TGCCACAACC AGCTGACCCT AATTGTGTCC ATGCCTTCCA AACCATCGAT AGCGATATGA CTTACGATAG CTTTGGCAAC CCCAAAACGA CCACAATCTA CAGTGGCTTT GGCTATCGTT CGCTCAAAGC CAACTTTAGC GATTCGCTCG ATTGGAAGCC CACCGAAACT GGCCAACCAA ACCTGAGCCA AGTTGCCAGT TTGTGGTACG ACAGCGATTA CAATCTATAT CCAGTCAAAA CCACTAACGC ATTAAATCAA GCTACAACCT ACGAAATTTA TGGTTTCCGC AACGATGCTG GCAACATCGC CGCCGTTGAT GGCTTCCAAA TTCAGACTGG CTTGCTCAAA AGTGTCACCA ATCCCGATGC CACGATTGTA CGCTACGAGT ACGATCCATT TGGTCGCTTA GTCAATACTT TCGATAGCTA TAGCTTCACG GGCTTTGGCG ATAGCACCAA GTGGAACGGC AATCCAGTTA TTCGCTACCG CTATTGGGAT AATTATTGGA ACGACAGCGC GGTTTTTGCC AATCCTGCTG CCAATCAACC ATTCCTGATC AGCGATGAAA AGCGGCCTGG CAGCTATGCC AACCCCAGCA GCACTGGCAA TTTCGCCTAC AACGACCAAA CCATGTACGA TGGTTTTGGC CGCGCTATCC AAAGCCGCCA TATTTGGGCC GATGTTGATG GCGAAGCCAA GCGCCAAGAA ATTTATAGCA CAACCGCCTA CAATGCACTC GGACAGACAA TCTGCCAAAC TGCGCCGTTC AACCTGCCGT TCTACATCGA TCGCGGTTTG GTTTGGCCAG CGTCACCCTT CGTCACCACG CCATGCAGCG ATAGCAGTAT GGCCAAAACC CTGACCAGCT ACGATAATTT TGGGCGGGTC AAGCAAACCA CTGCCGCCGA TGGCAGCCTT AACAAAGCCA ATTGGAGCTT AGTTAACAAC ATCACGGTTG CTGGCCAAAA TCTCTTCTGG CAGCATCAAC AAATTAATCC CAAGAACCAA CTAGAAATGC GCTTGATCAA CAATCAAGAA CAGTTGGTCT TGCGGCGCGA ATATCGCGGT ACTGCCGATA GCCCAATCGT TTATAGCGAT ACCCAGTTCC AATACGATAC CCTCGGCAAT ATCAACAAAA TCAGCCGCCG CCAACCTAGC AACGCTGGCA ACGGAGCGTT GATCGCACCC GAAGCAACCA TGGTTTACAA CGGCTTTGGC CATAAGTTGC AAATCAACGA CCCCGATATG GGCACAATCA AGTATCGCTA TAACGCCAAT GCGCGAATTA TTGAGCAACG CACGCTCAAC GATAGTCTGC TTACGAACGA TGACGATGTT GTGTGTTTCT ATTTCGATGC GTTGCAGCGC AATACCAGCA AAAATACCAC CAATGCTGGC GCTAATTGTT CTAACACGCC AATTTTGAAC GGGGCGTTGT GGCTAGCCAA CTCAAGCTAC TACAGCAGTG GCGCAGGCAA AATCGGCAAA CTCCAATCAG TTAAATGGAA TCGTGATGGT AATGGCGCGG TTGATGGCGA AAGTTTCAAC TACAATAGCC TTGGCTTGCT CACCAGCCAT ACGCGCACGC TCAATGGCGT AAGCTTTAGC ATGCAATTTG GCGACTTCGA TGCCCTCAAT CGCGCCACCA CAATCACCTA TCCCGATGGC GAAGTCGCAA CCATCACCCA CGATTTAGAA GGCGAAAATA GCCTGAGTTT GGGCAGCCAT GGTGCGTTGG TAAGCAATAT CGAATACAAC GCCCGTGGCC ACATCAGTCT AATCGATCGT ACAAATGGCG GGCATAACAC GGTCTTTAAT TATTATGGCG CAACTGGCAC GGCCAACACA GGCAACAGCA ATTTCCGCCT AGCTAGCATC AATCATCAAC ATAGCCTGTT GCCGAGCTAT ACCTACGAGT ACGATCAAAT CGGCAATATC AGCCTCCTCT ACGAAAGTGG CTCGCTCTCG GGCAATACCT ATTTCAATTA TGATGAATTG GATCGTTTGA CCAGCACCAG CGGCATTTAC AGCCATATCT ATGCTTATGA CAAGCTGGGT AACTTAACCA ACAATAATGG CATTGCTCAA ACCTACAACG GCATAGGTAC TCAGCCGCAT GCGCTGCGTT CAACCAGCCA AGGCAATTTC TTCGAGTACG ATCAAGCAGG CAATATGATC GTGCGCAACG ATGCTAGCGG CCTGTATCAA CAAGCCTTCG ATGTTGAGCA ACGCTTGTAT GAAGTGATCG ATCAACACGA TCAAACCACG CGCTTCCGCT ATGATCCCAG CGGCCAGCGC ACTACCACCT TCGCCGCTGA TGGCACAGTT ACCTACGATC CTTTCCCGAA TTATCAACGC ACGACCGTCA GCAGCAGCAA CTCAGCCGTG GATAGCCTCA ACGCTGGAGT TCTGTGTAGC GATTACAACC CTGAAACAAA AAGCTATGGC GGCGCTGGCT ATATTATGTA CAGTGAAATC CCAGTCAAAC AGCGCTTTGG CAATTTGCCA GCGGCCAACA TCAGCGACCA CTTCATCTGT GTGCGCAACA ATACTGGCGT TTGGGAATAC GATAACGATG CTGGCTTCTA TGCATTCACT CCGATTGCCA GCGACTTGTT GGTCGCCAGT TTCAACTACA ATGCAACCAC AGTCAGCCCA TATCTCAATC AATCAGGGGC GATTTATGGC CTGCGCTATG GCTATACCAC CAGCAACCTA GCCTTCAGCA AAGATGTCTT TGGTGGAACC AACAATCCAG GCGAGTTCGA AATTGCTGGA ACCCACTTCA AAACCAATGC CTTTGCCCAA AGCGTCGCCA ACCATGGCTA TGGTGTGGCT TGCCAAGAAG ATGCAACTGG CACAGGCTAC CTGATGTACA GCGCTGAATC GGTGCATAGC CGCTTCGCCG AGCAAGCGCC CGATATCAAC AATGCTGCGC ACTTCATCTG TGTACGCCAC AACGGCCAAA CCTGGCAATA CGATAATAAT TCGGCCTATT TCGCCTTTAC TCCACGCCAA AGCGACCGCT TGATCGGCGC AATCGATTTC AGCAACGATA GCTACACCAG CTATGTCGGC CAAACTGGCA CGATCTTGGG CATGCAAAAA GGCCTAAGCA GCAGCAACTT AACCATCACG GTCAATCAAT GGAATGGCGA GAGCAACCCA GGCGAATTTG GGATTGCTGG CTTGAATTTC ACGCCGCAAG CCTACGAAGT AACGATCACA TCGGCAGGCA TGGGCATCAA TTGTTTGGAT ACCGCGACAG GCACTGGCTA TATCATGCAC AGTCGCCAAG CCCTGAACCA ACGCTTTAGC CAATCGATCC CAGCGCAACT TGCCAGCAAA CACTTTGTTT GTGTGCGCTA TAACAGCACG CTCAGCACTT GGCAATACGA TGATGGCAGC AATTACTATG GCTTCACGCC ACGCGCAAGC GACACCTTGG TTGCCAGCGT CAACTTCAGC ACCGACCAAG TAACGAGCCT CGCGGGAGCA AGCGACAGCG AATTTGGCAT AACCAAAGGC TTTGTCAGCG GTATTAGCAT CGTCGCCAAT CAATGGGGCG GCAATAGCAA TGCTGGTGAG TTTCAAGTTA TTGGCAACAA TTTAACCACG CATACGATTG ATATCGGCAG CAAAACTGTG GCAATCGCTG GCACGCCAAT CGCTACCCGC CGCAAACATA GTGTCGCCAC AAGCTTGGTT GATCAATCAT TAGTGTTTGT GTATGTCGAT AAGCTCGGCA GCGCCAACAC TTTGATGGAT CAAACGGGCA CAGCGATTTT GAATAATGTA CGCTATCTGC CATTTGGCGA GGAACGCCTA GGCCTCAACT CAGCCTATAG CGATCGCGGC TTTACTGGCC ATCAAGAGAA TCGTGAGCTT GGCCTAACCT ATATGAATGC CCGCTTCTAT CTGCCAAGCA CTGGCCGCTT TATCAGTGCC GACAGCATGA TTCCTGAGCC GAGTAATCCC CAAAGTTTCA ATCGCTATAG CTATGTCTAC AACAACCCAA TCAACGCGAC TGATCCTTCG GGTCACTTGC CAGGTGATGA TGAACCTGAA ATTCCAAATC CGTTCCCTGA ATCGAATCCA ATCCCTAATA TGGAGTACAG CGCCTATAGA AAATGGCTTA ACTTCTGGCA AGCCTATACC GACAACGATA ATCCCTACAT CATGATTAAA CAAGGAGATC AACTTGTTGA AAGCTCAGTT GCAATCAAAG CGGGGTCTGT CAGTCTAAGC CCAGATAGCG TTAGTGTTGC TGGAAATTGG GGATTGGTAG GTGCTGAAGT ATCAATGCCA GGAGCATATA AGAAAGGTGA AGGCGATAAT TTCTTAGAAA CACTGTGGGA AGGAACATCA GCAAAAATAT TAATAGGTCC ACAGATCGAC TTAATTGTTG TCGAAGTGAT GCCTTTAGCA TTGGGGATCG ATCCATTTAC TGGTAATATT ACCCTCGAAA CAAGCGCAGG AATTGGACTA GTCGAGGGAA GTACTACAAT CAATCCGTTT GCCAAAACAG ATGCGATCTA TGTTTTGCAA ATCAGCGATG AATTGCATGA TAAGCTGATG GGTCGAGATC CATGTGCCTT GAGTGTTTCG TTCCAAGATC AACAAGCAGC TTGGACGGAA TTAATCAATG TAGTCAACAG CTATGGGTTT AATGGCACTG ACCCTCGCTG GACACTCCAT TCAATTCCAA ATTACGTCTA CAATGGCGAG GAAAACACAC CCTGA
|
Protein sequence | MSVNRPYRRL IHVGMLLSML WSLFPASQPA TAQTDQSNKT VEQAFAPDTG LTRDPQTPRL ATALPKKASS RADFMPVAAQ NLSIQPQLNQ VVTAQLDSAK TTAVQFEQAP LSLLVEADTF ATATRLEFQA QALPNLTQQL QRSSKSEGYL RDQAEQVTFY RFNIEAQTND QAANFKKPVR MVLDLRQLMR DVPSNYQQFY LAYQDQADPN HWIEVPITLH DAKGLISADV DHFSTWAAGT RPERWNPSWV PAAVASFSGA ATYAYPIEGP MGRGGLQPKV ELSYNSRSLD GRIRDDTGAG PLGDAWSISD ISIARVGVKT EFVAGFPSNK HPDNFRLTIN GAGHELFPEF PSQTATASSM RYFAKDAPGY YIKRVYNTAT PNTDGIYWIV VTPTGVTYRL GYYPHAEEHQ IWDIGYWNVQ GHQGRPNNER SALAWHVDTV TDRAGNQMTY QYVNWTVTEP IEWYLQGSSH KSTLQLTTWK SRIGSISYNY PNRVTALPVS DTVAQLSTTP ASRLVFTTKT QFAYLIDTIY VYHGSLSTPI KEYRINLSGH FVDSPACMNQ DTQPNIPRST HTRVVNAITV ASGVDADPTT EDGWTLPATS FTYEAKPHYN NNCFSFYYLK SMRSNYGGEI SFNYASDNRW IGDYTYLGYD RYVWPSLGQS YYVVETLAND GRNPAVKTTY SYSQPCYGQW SSNVPAGAIT CGASDAPEFG TITGFATVNQ RSYDFNGTTL LKRNETIFSQ NSATTSGKPM IQRNFAGDGT LLDATHSTYN TDSLHGLPNM FTYLSEVKSY QYSNGLEIST KRTFGYDVAK QAGVQYGNLT DTWLYASAAA TTPYEKQVTY YTPNNGRATG GQAWMVNAPT ASGRYDQQGT FLNGTWIYYD GATTNATAPT QGLVTRSRQT RPITCAEIPN PSGLPQPADP NCVHAFQTID SDMTYDSFGN PKTTTIYSGF GYRSLKANFS DSLDWKPTET GQPNLSQVAS LWYDSDYNLY PVKTTNALNQ ATTYEIYGFR NDAGNIAAVD GFQIQTGLLK SVTNPDATIV RYEYDPFGRL VNTFDSYSFT GFGDSTKWNG NPVIRYRYWD NYWNDSAVFA NPAANQPFLI SDEKRPGSYA NPSSTGNFAY NDQTMYDGFG RAIQSRHIWA DVDGEAKRQE IYSTTAYNAL GQTICQTAPF NLPFYIDRGL VWPASPFVTT PCSDSSMAKT LTSYDNFGRV KQTTAADGSL NKANWSLVNN ITVAGQNLFW QHQQINPKNQ LEMRLINNQE QLVLRREYRG TADSPIVYSD TQFQYDTLGN INKISRRQPS NAGNGALIAP EATMVYNGFG HKLQINDPDM GTIKYRYNAN ARIIEQRTLN DSLLTNDDDV VCFYFDALQR NTSKNTTNAG ANCSNTPILN GALWLANSSY YSSGAGKIGK LQSVKWNRDG NGAVDGESFN YNSLGLLTSH TRTLNGVSFS MQFGDFDALN RATTITYPDG EVATITHDLE GENSLSLGSH GALVSNIEYN ARGHISLIDR TNGGHNTVFN YYGATGTANT GNSNFRLASI NHQHSLLPSY TYEYDQIGNI SLLYESGSLS GNTYFNYDEL DRLTSTSGIY SHIYAYDKLG NLTNNNGIAQ TYNGIGTQPH ALRSTSQGNF FEYDQAGNMI VRNDASGLYQ QAFDVEQRLY EVIDQHDQTT RFRYDPSGQR TTTFAADGTV TYDPFPNYQR TTVSSSNSAV DSLNAGVLCS DYNPETKSYG GAGYIMYSEI PVKQRFGNLP AANISDHFIC VRNNTGVWEY DNDAGFYAFT PIASDLLVAS FNYNATTVSP YLNQSGAIYG LRYGYTTSNL AFSKDVFGGT NNPGEFEIAG THFKTNAFAQ SVANHGYGVA CQEDATGTGY LMYSAESVHS RFAEQAPDIN NAAHFICVRH NGQTWQYDNN SAYFAFTPRQ SDRLIGAIDF SNDSYTSYVG QTGTILGMQK GLSSSNLTIT VNQWNGESNP GEFGIAGLNF TPQAYEVTIT SAGMGINCLD TATGTGYIMH SRQALNQRFS QSIPAQLASK HFVCVRYNST LSTWQYDDGS NYYGFTPRAS DTLVASVNFS TDQVTSLAGA SDSEFGITKG FVSGISIVAN QWGGNSNAGE FQVIGNNLTT HTIDIGSKTV AIAGTPIATR RKHSVATSLV DQSLVFVYVD KLGSANTLMD QTGTAILNNV RYLPFGEERL GLNSAYSDRG FTGHQENREL GLTYMNARFY LPSTGRFISA DSMIPEPSNP QSFNRYSYVY NNPINATDPS GHLPGDDEPE IPNPFPESNP IPNMEYSAYR KWLNFWQAYT DNDNPYIMIK QGDQLVESSV AIKAGSVSLS PDSVSVAGNW GLVGAEVSMP GAYKKGEGDN FLETLWEGTS AKILIGPQID LIVVEVMPLA LGIDPFTGNI TLETSAGIGL VEGSTTINPF AKTDAIYVLQ ISDELHDKLM GRDPCALSVS FQDQQAAWTE LINVVNSYGF NGTDPRWTLH SIPNYVYNGE ENTP
|
| |