Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5268 |
Symbol | |
ID | 5737226 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009974 |
Strand | + |
Start bp | 44985 |
End bp | 50861 |
Gene Length | 5877 bp |
Protein Length | 1958 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641282432 |
Product | TPR repeat-containing protein |
Protein accession | YP_001548023 |
Protein GI | 159901778 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.494598 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGCA CGTTTGCATC CTTCCATCTC ACCATTGCCG CCCCCCACGG GGATCGCTAT CCCGTGACCG CCCGCACTCA GGCGGGCAAT GAAGTCAATG AGGACATCCT ACTGCCGCTT GATGATCCGA CCCTGACCGT CTACCAGATG GCGCTGCGTT ATCCCATATC TATTGACGAA TCCGTGGTGA TTGCGGTTGG CCAACTCCTC TACCAAGCCC TGTTTCAAGG AACCATTGCG GAAGCCTTTG CCACGGCTCG CGCCCACGCC GATCAGCAAA AAGTGGCGTT GCGTCTCCAT TTGGCGATTC AGAAATCCCT CCCCACAATT GCGGCACTTC CATGGGAACT GATGGCGACT GAGGCGGGCC GCCCGCTTAT GCTGGAACAT GCCTTAGTGC GAACTTTTTC GTGGAACGAT CCGATTCCTG ATTTGGGGAT TCCCTCGGGC GAGCGTATTC GGCTCGCCGT GACCTCGGCC TTACCTATGG AGTTGGCTGA TCACCCGATT GCGGCGGAAG ATGAAGTTGC CATCATCCGT GCTGCCATCA CGCATAGCGC ACGGCCAATT GATCTCATCG AAGTCCCTCA CCTCACCCGC GACCGCTTGA CCGATCTGCT CACGAACCAA CGCCCCCATA TCGTGCATCA TATCGGCCAT GGCATTATTC AATCAGATAT GAGCTATCTC GACCTTGAAC GCGCCGACCA GTCCCGTGAT CAGCTCTCGG CTCGCGAGTT CAGCAGTATG CTGCATCAGT CAGGGGTACA GCTGGTGGTC TTGAATGCCT GCCACACGGG CAGCGCTGGC GAGAACCTCC TCACCAGCTT TGCCCCAATT TTCATCACCG ATCGCATTCC GGCAGCGATT GGCATGCAAG CCGCGATCCT GAATCGCACT GGCCACTGCT TTGCGAATGC CTTCTATGCC GCCCTGGGCA ACAGTGGATC GATTGATGCC AGCCTGATTG CAGCACGCAA GGCAATTTTG GCCGATGGGC ATGAGCATGG GGCATGGGGG TTTCCAACCC TGTATAGTCG GGTTCCGCAC AGCCAACTGT GGGGTTATCG AACGCCACAG CCTGTTGATC CACTCACCCA ACGCGACCGC GAACGCATCA CCAAAGCCGC TCAACAAATC GAGAGTGATC GGCTTTTAAA TCTGCGCCTT CAGGGCTTCG TCGGTCGGGT CAACGAACTG GCGGCCATGC GCGAGCATAT CGAAGCGATG CGGCCTACGG GGGGCTATGT CCTGATCAAA GCGGCAGCAG GCGAAGGCAA GAGTAGCAGC ATTGCCAAGC TGATTCAGGA GGCGGGGATT GCGCAGACCC CGCACCACTT TATTGCCCTC ACCACGGGCC GCGACTATCA ATTGGGGTTG TTGCGTGCGG TGGTGGCGCA ACTGATTCTC AAACACAACC TGACGGTTTC CTATTTCCCC GAAGAAAGCT ATCCGGCCAT GAAGGGCGAG TTTGCGCGGA TTCTGGATGA TCTTTCCAAA CAGGGGATTC AGGAAACAAT CTATCTGGAT GGCCTCGATC AACTCCAACC AGAGCGTGAT GGCTCTCGGG ATCTCTCCTT TCTGCCACCG CAGCCGCCGC CAGGCATCGT GATCGTCCTT GGCTCACGAC CGGATGAAAC CTTGAAACCG CTGGAGATTT TGCATCGGGT TGATTATGCC TTGCCACCAC TCAGCGAAGG GGATGCTCTC GCGTTGTGGC GATCGGTCCA GCCTGCCGTG GCGGATAGCC TCTTGCATGA CCTCTATACA GCACTGAAGG GCAATGCGCT GTTTGTCTAT TTGGCTGCCG ATACGATGCG CGATCAATCC GTGGTCGATG CGACCAGTTT GATTCGCCGG ATTGAACAGA ACCCGCACAA CCTTTTCGGC ATCACGCTGG AGCGGATCAA AAATCGTTCT GAGCATCGTT GGCCGACCAT TTGGAAGCCC ATGCTGGCGC TGTTGCTGGT CGCCCAAGAA CCATTGCGGC TGGATGTCTT GGGCGATCTC CTCGAACATG ACCACGATAC CATGCAGGAT GCCGTGTGGG TCTTGGGCGG TTTGGTCAGT CAGGGCATTG ATCAACGGGT TGCGCTCCAT CATCTACTCT TTCGTGACTT TTTGGCGGCA TCGGTGTTTA ATGATCGTGA GGTCAAACGC TGGCACCAAC GACTGGCTGA CTGGTGTGCG CAGGATGTGG ATGCGATTTG GGCCGATCAT CCCGATACAA TGGAACAAGC ACGGCGGATC TATGCGCGGT ATCACTACAT CATCCATCTT GCACTGGCTG AGAACTGGAC AACACTCTGG CAGGTCTTGG ATACGGGCGA CTATGGCGAA CAGAAAACCC GCTTTGATCC GAGTACCCGC TTGTATGCGC TCGATTTGGA TCGCGGACGT GAAAGTGCTA TCACGGCGGG GCGATCAACC GAGGAACATA TTCAGAACTT GCCACGCTTG TGGAAATATA GTTTGTTGCG GACGAGCTTA ACCAGCCGTG TTGATCAATG GCCGGATGAG GTGTTTGAGG TTTTGGCAAT AGTTGGCCGC ACGCACGAAG CATTGGAACG CATTGAACTT CTTTCGAGCA ACGAGAATCA GATATCATGT TGGCTCAGGA TTCTTCCATG GTGTGATAAC AAACAACAGC ATTTACTGGT CATGCGATTG GACGAGGTAT GCAGACATCT TTCGGGCTTC GACAAAAAAA TGGATGCGAT TCGTGCTATT GCGAACGCTG CTGCCACCCA CGGCCAGCGA GAACAGGCAT TAGCGATTCT CACCACGGCT GTGTCCATCG CCCACTCCAT TGATCACCCA TACTATAGGA GCGATGCGCT CGCGACCATT GCAAAAGACA CCGCCACCTA CGGCCAGCGA GAACAGGCAT TAGCGATTCT CACCACGGCT GCGTCCATCG CTCACTCCAT TGATGACCCA TCCGATAAGA GCGCTGCGCT CGCGACCATT GCAATTGCTG CTGCCACCCT CGGCCAGCGA GAACAGGCAT TAGCGATTCT CACCACGGCT GTATCCATCG CCCACTCCAT TGATGACCCA TACGAGAAGA GCGATGCGCT CGCGACCATT GCAATTGCTG CTGCCACCCT CGGCCAGCGA GAACAGGCAT TAGCGATTCT TGCCACGGCT GTGTCCATCG CCCACTCCAT TGATCACCCA TACTATAGGA GCGATGCGCT CGCGACCATT GCAAAAGACA CCGCCACCTA CGGCGCTATC GACCAAGCCC TGTCCATCGC CCACTCCATT GATAACCCAT ACTATAGGAG CGATGCGCTC GCGACCATTG CAAATGCTGC TGCCACCCAC GGCCAGCGAG AACAGGCATT AGCGATTCTC ACCACGGCTG TATCCATCGC CCAATCCATT GATGGCTTCA ATAAGAGCGC TGTGCTCGCG ACCTTTGCAA ATGCTGCTGC CACCTACGGC GCTATCGACC AAGCCCTGTC CATCGCCCAC TCCATTGATG GCTTAAATAG GAGCGATGCG CTCGCGACCG TTGCAAAAGG CATCGCCACC TACGGCGCTA TCGACCAAGC CCTGTCCATC GCCCACTCCA TTGATCACCC ATACTATAGG AGCGATGCGC TCGCGACCAT TGCAAATGCT GCTGCCACCC ACGGCCAGCG AGAACAGGCA TTAGCGATTC TCACCACGGC TGTATCCATC GCCCAATCCA TTGATGGCTT CAATAAGAGC GCTGTGCTCG CGACCTTTGC AAAAGACACC GCCACCTACG GCGCTATCGA CCAAGCCCTG TCCATCGCCC ACTCCATTGA TCACCCATTG CAACGACGCG ATGTTCTCCT TATTATCGCA GCCACTGCTG CCACCCACGG CCAGCAAGAA CAGGCATTAG CGATTCTTGC CACAGCTGTG TCCATCGCCC ACTCCATTGA TGACCCATCC CATAAGAGCG CTGTGCTCGA GACCATTGCA AAAGACACCG CCACCTACGG CGCTATCGAC CACGCTGTAT CCATCACCCA ATCCATTGAT AGCCCATACC ATAAGAGCAA GGCGCTCGCG TCCATTGCAA ATGCTGCTGC CACCCTCGGC CAGCGAGAAC AGGCATTAGC GATTCTCACC ACGGCTGTAT CCATCGCCCA CTCCATTGAT CACTCATACC ATAAGAGCGC TGTGCTCGCG ACCTTTGCAA ATGCTGCTGC CACCTACGGC GCTATCGACC AAGCCCTGTC CATCGCCCAC TCCATTGATG GCTTAATAAA TAGGAGCGAT GCGCTCGCGA CCGTTGCAAA AGGCACCGCC ACCTACGGCG CTATCGACCA AGCCCTGTCC ATCGCCCACT CCATTGATCA CCCATTGCAA CGGCGCGATG TTCTCCTTAT TATCGCAGCC ACTGCTGCCA CCCACGGCCA GCGAGAACAG GCATTAGCGA TTCTTGCCAC GGCTGTGTCC ATCGCCCACT CCATTGATAT CCCATGGCAG AAGAGCGATG CGCTCGCGAC CATTGCAAAA GACACCGCCA CCTACGGCGC TATCGACCAC GCTGTGTCCA TCACCCACTC CATTGATGAC CTATTCTATA AGAGCGATGC TCTCTTTATT ATCGCAGCCA CTGCTGCCAC CCTCGGACAG CAAGAACAGG CATTAGCGAT TCTCGCCACA GCTGTGTCCA TCGCCCAATC CATTGATAGC CTATACTATC TGAGCGATGC GCTCGCGACC ATTGTAAATG CTGCTGCCAC CCACGGCCAG CGAGAACAGG CCTTGGCCAT CGCCCATTCC ATTGATGACC CATCCCATAA GAGCGCTGTG CTCGCGACCA TTGCAATTGC TGCTGCCACC CACGGCCAGC GAGAACAGGC CTTGGCCATC GCCCATTCCA TTGATGACCC ATACCATAAG AGCGATGCGC TCACGACCAT TGCAAATGCT GCTGTCACCC ACGGCCAGCG AGAACAGGCA TTAGCGATTC TCACCACGGC TGCGTCCATC GCCCACTCCA TTGATGACCC ATACTATAAG AGCGGTGCGC TCGCGACCAT TGCCAACGCT GCTGCCACCC TCGGCCAGCG AGAACAGGCA TTAGCGATTC TCGCCACGGC TGCGTCCATC ACCCAATCCA TTGATGACCC ATACGAGAAG AGCGATGCGC TCGCGACCAT TGCCAACGCT GCTGCCACCC TCGGCCAGCG AGAACAGGCA TTAGCGATTC TCGCCACGGC TGTGTCCATC ACCCACTCCA TTGATAGCCC ATCCCATAAG AGCACTGTGC TCGCGACCAT TGTAAATGCT GCCACCCACA GCGCTATCGA CCACGCTGTA TCCATCACCC AATCCATTGA TAGCCCATAC CATAAGAGCA AGGCGCTCGC GTCCATTGCA AATGCTGCTG CCACCCTCGG CCAGCGAGAA CAGGCATTAG CGATTCTCAC CACGGCTGTA TCCATCACCC ACTCCATTGA TAGCCCATCC CATAAGAGCA CTGTGCTCGC GACCATTGCA AATGCTGCTG CCACCTACGG CGCTATCGAC CAAGCCCTGT CCATCGCCCA CTCCATTGAT GACCCATTGC AACGGCACGC TGTACTCGCG ACCATCGCAG CCGCTGATGC ATCCCAAGAC GCTATCGAAC GTGCGCTGTC CATTGCCCAC TCCATTGACA ATCTAGACCA CCGTGCCGAG ACCTTTCGCA TCATTCTTCA AAAAGACCTA TCAGTCATAG ACGTTTTAAC AAGCATTCAG CATGAATGGT TCCGCAGTAA AATACCTCAG GATCTATGGA CAATGACACC AATGATTGCT CCATTATTAA ACGACTATCC ATGGTTAGGA ACAGTAATTC TGGAAGAAGA GGCATGGGTG AACGAACAAC TCAAACGACT GGGGTAA
|
Protein sequence | MSSTFASFHL TIAAPHGDRY PVTARTQAGN EVNEDILLPL DDPTLTVYQM ALRYPISIDE SVVIAVGQLL YQALFQGTIA EAFATARAHA DQQKVALRLH LAIQKSLPTI AALPWELMAT EAGRPLMLEH ALVRTFSWND PIPDLGIPSG ERIRLAVTSA LPMELADHPI AAEDEVAIIR AAITHSARPI DLIEVPHLTR DRLTDLLTNQ RPHIVHHIGH GIIQSDMSYL DLERADQSRD QLSAREFSSM LHQSGVQLVV LNACHTGSAG ENLLTSFAPI FITDRIPAAI GMQAAILNRT GHCFANAFYA ALGNSGSIDA SLIAARKAIL ADGHEHGAWG FPTLYSRVPH SQLWGYRTPQ PVDPLTQRDR ERITKAAQQI ESDRLLNLRL QGFVGRVNEL AAMREHIEAM RPTGGYVLIK AAAGEGKSSS IAKLIQEAGI AQTPHHFIAL TTGRDYQLGL LRAVVAQLIL KHNLTVSYFP EESYPAMKGE FARILDDLSK QGIQETIYLD GLDQLQPERD GSRDLSFLPP QPPPGIVIVL GSRPDETLKP LEILHRVDYA LPPLSEGDAL ALWRSVQPAV ADSLLHDLYT ALKGNALFVY LAADTMRDQS VVDATSLIRR IEQNPHNLFG ITLERIKNRS EHRWPTIWKP MLALLLVAQE PLRLDVLGDL LEHDHDTMQD AVWVLGGLVS QGIDQRVALH HLLFRDFLAA SVFNDREVKR WHQRLADWCA QDVDAIWADH PDTMEQARRI YARYHYIIHL ALAENWTTLW QVLDTGDYGE QKTRFDPSTR LYALDLDRGR ESAITAGRST EEHIQNLPRL WKYSLLRTSL TSRVDQWPDE VFEVLAIVGR THEALERIEL LSSNENQISC WLRILPWCDN KQQHLLVMRL DEVCRHLSGF DKKMDAIRAI ANAAATHGQR EQALAILTTA VSIAHSIDHP YYRSDALATI AKDTATYGQR EQALAILTTA ASIAHSIDDP SDKSAALATI AIAAATLGQR EQALAILTTA VSIAHSIDDP YEKSDALATI AIAAATLGQR EQALAILATA VSIAHSIDHP YYRSDALATI AKDTATYGAI DQALSIAHSI DNPYYRSDAL ATIANAAATH GQREQALAIL TTAVSIAQSI DGFNKSAVLA TFANAAATYG AIDQALSIAH SIDGLNRSDA LATVAKGIAT YGAIDQALSI AHSIDHPYYR SDALATIANA AATHGQREQA LAILTTAVSI AQSIDGFNKS AVLATFAKDT ATYGAIDQAL SIAHSIDHPL QRRDVLLIIA ATAATHGQQE QALAILATAV SIAHSIDDPS HKSAVLETIA KDTATYGAID HAVSITQSID SPYHKSKALA SIANAAATLG QREQALAILT TAVSIAHSID HSYHKSAVLA TFANAAATYG AIDQALSIAH SIDGLINRSD ALATVAKGTA TYGAIDQALS IAHSIDHPLQ RRDVLLIIAA TAATHGQREQ ALAILATAVS IAHSIDIPWQ KSDALATIAK DTATYGAIDH AVSITHSIDD LFYKSDALFI IAATAATLGQ QEQALAILAT AVSIAQSIDS LYYLSDALAT IVNAAATHGQ REQALAIAHS IDDPSHKSAV LATIAIAAAT HGQREQALAI AHSIDDPYHK SDALTTIANA AVTHGQREQA LAILTTAASI AHSIDDPYYK SGALATIANA AATLGQREQA LAILATAASI TQSIDDPYEK SDALATIANA AATLGQREQA LAILATAVSI THSIDSPSHK STVLATIVNA ATHSAIDHAV SITQSIDSPY HKSKALASIA NAAATLGQRE QALAILTTAV SITHSIDSPS HKSTVLATIA NAAATYGAID QALSIAHSID DPLQRHAVLA TIAAADASQD AIERALSIAH SIDNLDHRAE TFRIILQKDL SVIDVLTSIQ HEWFRSKIPQ DLWTMTPMIA PLLNDYPWLG TVILEEEAWV NEQLKRLG
|
| |