Gene Haur_5268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5268 
Symbol 
ID5737226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009974 
Strand
Start bp44985 
End bp50861 
Gene Length5877 bp 
Protein Length1958 aa 
Translation table11 
GC content56% 
IMG OID641282432 
ProductTPR repeat-containing protein 
Protein accessionYP_001548023 
Protein GI159901778 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.494598 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGCA CGTTTGCATC CTTCCATCTC ACCATTGCCG CCCCCCACGG GGATCGCTAT 
CCCGTGACCG CCCGCACTCA GGCGGGCAAT GAAGTCAATG AGGACATCCT ACTGCCGCTT
GATGATCCGA CCCTGACCGT CTACCAGATG GCGCTGCGTT ATCCCATATC TATTGACGAA
TCCGTGGTGA TTGCGGTTGG CCAACTCCTC TACCAAGCCC TGTTTCAAGG AACCATTGCG
GAAGCCTTTG CCACGGCTCG CGCCCACGCC GATCAGCAAA AAGTGGCGTT GCGTCTCCAT
TTGGCGATTC AGAAATCCCT CCCCACAATT GCGGCACTTC CATGGGAACT GATGGCGACT
GAGGCGGGCC GCCCGCTTAT GCTGGAACAT GCCTTAGTGC GAACTTTTTC GTGGAACGAT
CCGATTCCTG ATTTGGGGAT TCCCTCGGGC GAGCGTATTC GGCTCGCCGT GACCTCGGCC
TTACCTATGG AGTTGGCTGA TCACCCGATT GCGGCGGAAG ATGAAGTTGC CATCATCCGT
GCTGCCATCA CGCATAGCGC ACGGCCAATT GATCTCATCG AAGTCCCTCA CCTCACCCGC
GACCGCTTGA CCGATCTGCT CACGAACCAA CGCCCCCATA TCGTGCATCA TATCGGCCAT
GGCATTATTC AATCAGATAT GAGCTATCTC GACCTTGAAC GCGCCGACCA GTCCCGTGAT
CAGCTCTCGG CTCGCGAGTT CAGCAGTATG CTGCATCAGT CAGGGGTACA GCTGGTGGTC
TTGAATGCCT GCCACACGGG CAGCGCTGGC GAGAACCTCC TCACCAGCTT TGCCCCAATT
TTCATCACCG ATCGCATTCC GGCAGCGATT GGCATGCAAG CCGCGATCCT GAATCGCACT
GGCCACTGCT TTGCGAATGC CTTCTATGCC GCCCTGGGCA ACAGTGGATC GATTGATGCC
AGCCTGATTG CAGCACGCAA GGCAATTTTG GCCGATGGGC ATGAGCATGG GGCATGGGGG
TTTCCAACCC TGTATAGTCG GGTTCCGCAC AGCCAACTGT GGGGTTATCG AACGCCACAG
CCTGTTGATC CACTCACCCA ACGCGACCGC GAACGCATCA CCAAAGCCGC TCAACAAATC
GAGAGTGATC GGCTTTTAAA TCTGCGCCTT CAGGGCTTCG TCGGTCGGGT CAACGAACTG
GCGGCCATGC GCGAGCATAT CGAAGCGATG CGGCCTACGG GGGGCTATGT CCTGATCAAA
GCGGCAGCAG GCGAAGGCAA GAGTAGCAGC ATTGCCAAGC TGATTCAGGA GGCGGGGATT
GCGCAGACCC CGCACCACTT TATTGCCCTC ACCACGGGCC GCGACTATCA ATTGGGGTTG
TTGCGTGCGG TGGTGGCGCA ACTGATTCTC AAACACAACC TGACGGTTTC CTATTTCCCC
GAAGAAAGCT ATCCGGCCAT GAAGGGCGAG TTTGCGCGGA TTCTGGATGA TCTTTCCAAA
CAGGGGATTC AGGAAACAAT CTATCTGGAT GGCCTCGATC AACTCCAACC AGAGCGTGAT
GGCTCTCGGG ATCTCTCCTT TCTGCCACCG CAGCCGCCGC CAGGCATCGT GATCGTCCTT
GGCTCACGAC CGGATGAAAC CTTGAAACCG CTGGAGATTT TGCATCGGGT TGATTATGCC
TTGCCACCAC TCAGCGAAGG GGATGCTCTC GCGTTGTGGC GATCGGTCCA GCCTGCCGTG
GCGGATAGCC TCTTGCATGA CCTCTATACA GCACTGAAGG GCAATGCGCT GTTTGTCTAT
TTGGCTGCCG ATACGATGCG CGATCAATCC GTGGTCGATG CGACCAGTTT GATTCGCCGG
ATTGAACAGA ACCCGCACAA CCTTTTCGGC ATCACGCTGG AGCGGATCAA AAATCGTTCT
GAGCATCGTT GGCCGACCAT TTGGAAGCCC ATGCTGGCGC TGTTGCTGGT CGCCCAAGAA
CCATTGCGGC TGGATGTCTT GGGCGATCTC CTCGAACATG ACCACGATAC CATGCAGGAT
GCCGTGTGGG TCTTGGGCGG TTTGGTCAGT CAGGGCATTG ATCAACGGGT TGCGCTCCAT
CATCTACTCT TTCGTGACTT TTTGGCGGCA TCGGTGTTTA ATGATCGTGA GGTCAAACGC
TGGCACCAAC GACTGGCTGA CTGGTGTGCG CAGGATGTGG ATGCGATTTG GGCCGATCAT
CCCGATACAA TGGAACAAGC ACGGCGGATC TATGCGCGGT ATCACTACAT CATCCATCTT
GCACTGGCTG AGAACTGGAC AACACTCTGG CAGGTCTTGG ATACGGGCGA CTATGGCGAA
CAGAAAACCC GCTTTGATCC GAGTACCCGC TTGTATGCGC TCGATTTGGA TCGCGGACGT
GAAAGTGCTA TCACGGCGGG GCGATCAACC GAGGAACATA TTCAGAACTT GCCACGCTTG
TGGAAATATA GTTTGTTGCG GACGAGCTTA ACCAGCCGTG TTGATCAATG GCCGGATGAG
GTGTTTGAGG TTTTGGCAAT AGTTGGCCGC ACGCACGAAG CATTGGAACG CATTGAACTT
CTTTCGAGCA ACGAGAATCA GATATCATGT TGGCTCAGGA TTCTTCCATG GTGTGATAAC
AAACAACAGC ATTTACTGGT CATGCGATTG GACGAGGTAT GCAGACATCT TTCGGGCTTC
GACAAAAAAA TGGATGCGAT TCGTGCTATT GCGAACGCTG CTGCCACCCA CGGCCAGCGA
GAACAGGCAT TAGCGATTCT CACCACGGCT GTGTCCATCG CCCACTCCAT TGATCACCCA
TACTATAGGA GCGATGCGCT CGCGACCATT GCAAAAGACA CCGCCACCTA CGGCCAGCGA
GAACAGGCAT TAGCGATTCT CACCACGGCT GCGTCCATCG CTCACTCCAT TGATGACCCA
TCCGATAAGA GCGCTGCGCT CGCGACCATT GCAATTGCTG CTGCCACCCT CGGCCAGCGA
GAACAGGCAT TAGCGATTCT CACCACGGCT GTATCCATCG CCCACTCCAT TGATGACCCA
TACGAGAAGA GCGATGCGCT CGCGACCATT GCAATTGCTG CTGCCACCCT CGGCCAGCGA
GAACAGGCAT TAGCGATTCT TGCCACGGCT GTGTCCATCG CCCACTCCAT TGATCACCCA
TACTATAGGA GCGATGCGCT CGCGACCATT GCAAAAGACA CCGCCACCTA CGGCGCTATC
GACCAAGCCC TGTCCATCGC CCACTCCATT GATAACCCAT ACTATAGGAG CGATGCGCTC
GCGACCATTG CAAATGCTGC TGCCACCCAC GGCCAGCGAG AACAGGCATT AGCGATTCTC
ACCACGGCTG TATCCATCGC CCAATCCATT GATGGCTTCA ATAAGAGCGC TGTGCTCGCG
ACCTTTGCAA ATGCTGCTGC CACCTACGGC GCTATCGACC AAGCCCTGTC CATCGCCCAC
TCCATTGATG GCTTAAATAG GAGCGATGCG CTCGCGACCG TTGCAAAAGG CATCGCCACC
TACGGCGCTA TCGACCAAGC CCTGTCCATC GCCCACTCCA TTGATCACCC ATACTATAGG
AGCGATGCGC TCGCGACCAT TGCAAATGCT GCTGCCACCC ACGGCCAGCG AGAACAGGCA
TTAGCGATTC TCACCACGGC TGTATCCATC GCCCAATCCA TTGATGGCTT CAATAAGAGC
GCTGTGCTCG CGACCTTTGC AAAAGACACC GCCACCTACG GCGCTATCGA CCAAGCCCTG
TCCATCGCCC ACTCCATTGA TCACCCATTG CAACGACGCG ATGTTCTCCT TATTATCGCA
GCCACTGCTG CCACCCACGG CCAGCAAGAA CAGGCATTAG CGATTCTTGC CACAGCTGTG
TCCATCGCCC ACTCCATTGA TGACCCATCC CATAAGAGCG CTGTGCTCGA GACCATTGCA
AAAGACACCG CCACCTACGG CGCTATCGAC CACGCTGTAT CCATCACCCA ATCCATTGAT
AGCCCATACC ATAAGAGCAA GGCGCTCGCG TCCATTGCAA ATGCTGCTGC CACCCTCGGC
CAGCGAGAAC AGGCATTAGC GATTCTCACC ACGGCTGTAT CCATCGCCCA CTCCATTGAT
CACTCATACC ATAAGAGCGC TGTGCTCGCG ACCTTTGCAA ATGCTGCTGC CACCTACGGC
GCTATCGACC AAGCCCTGTC CATCGCCCAC TCCATTGATG GCTTAATAAA TAGGAGCGAT
GCGCTCGCGA CCGTTGCAAA AGGCACCGCC ACCTACGGCG CTATCGACCA AGCCCTGTCC
ATCGCCCACT CCATTGATCA CCCATTGCAA CGGCGCGATG TTCTCCTTAT TATCGCAGCC
ACTGCTGCCA CCCACGGCCA GCGAGAACAG GCATTAGCGA TTCTTGCCAC GGCTGTGTCC
ATCGCCCACT CCATTGATAT CCCATGGCAG AAGAGCGATG CGCTCGCGAC CATTGCAAAA
GACACCGCCA CCTACGGCGC TATCGACCAC GCTGTGTCCA TCACCCACTC CATTGATGAC
CTATTCTATA AGAGCGATGC TCTCTTTATT ATCGCAGCCA CTGCTGCCAC CCTCGGACAG
CAAGAACAGG CATTAGCGAT TCTCGCCACA GCTGTGTCCA TCGCCCAATC CATTGATAGC
CTATACTATC TGAGCGATGC GCTCGCGACC ATTGTAAATG CTGCTGCCAC CCACGGCCAG
CGAGAACAGG CCTTGGCCAT CGCCCATTCC ATTGATGACC CATCCCATAA GAGCGCTGTG
CTCGCGACCA TTGCAATTGC TGCTGCCACC CACGGCCAGC GAGAACAGGC CTTGGCCATC
GCCCATTCCA TTGATGACCC ATACCATAAG AGCGATGCGC TCACGACCAT TGCAAATGCT
GCTGTCACCC ACGGCCAGCG AGAACAGGCA TTAGCGATTC TCACCACGGC TGCGTCCATC
GCCCACTCCA TTGATGACCC ATACTATAAG AGCGGTGCGC TCGCGACCAT TGCCAACGCT
GCTGCCACCC TCGGCCAGCG AGAACAGGCA TTAGCGATTC TCGCCACGGC TGCGTCCATC
ACCCAATCCA TTGATGACCC ATACGAGAAG AGCGATGCGC TCGCGACCAT TGCCAACGCT
GCTGCCACCC TCGGCCAGCG AGAACAGGCA TTAGCGATTC TCGCCACGGC TGTGTCCATC
ACCCACTCCA TTGATAGCCC ATCCCATAAG AGCACTGTGC TCGCGACCAT TGTAAATGCT
GCCACCCACA GCGCTATCGA CCACGCTGTA TCCATCACCC AATCCATTGA TAGCCCATAC
CATAAGAGCA AGGCGCTCGC GTCCATTGCA AATGCTGCTG CCACCCTCGG CCAGCGAGAA
CAGGCATTAG CGATTCTCAC CACGGCTGTA TCCATCACCC ACTCCATTGA TAGCCCATCC
CATAAGAGCA CTGTGCTCGC GACCATTGCA AATGCTGCTG CCACCTACGG CGCTATCGAC
CAAGCCCTGT CCATCGCCCA CTCCATTGAT GACCCATTGC AACGGCACGC TGTACTCGCG
ACCATCGCAG CCGCTGATGC ATCCCAAGAC GCTATCGAAC GTGCGCTGTC CATTGCCCAC
TCCATTGACA ATCTAGACCA CCGTGCCGAG ACCTTTCGCA TCATTCTTCA AAAAGACCTA
TCAGTCATAG ACGTTTTAAC AAGCATTCAG CATGAATGGT TCCGCAGTAA AATACCTCAG
GATCTATGGA CAATGACACC AATGATTGCT CCATTATTAA ACGACTATCC ATGGTTAGGA
ACAGTAATTC TGGAAGAAGA GGCATGGGTG AACGAACAAC TCAAACGACT GGGGTAA
 
Protein sequence
MSSTFASFHL TIAAPHGDRY PVTARTQAGN EVNEDILLPL DDPTLTVYQM ALRYPISIDE 
SVVIAVGQLL YQALFQGTIA EAFATARAHA DQQKVALRLH LAIQKSLPTI AALPWELMAT
EAGRPLMLEH ALVRTFSWND PIPDLGIPSG ERIRLAVTSA LPMELADHPI AAEDEVAIIR
AAITHSARPI DLIEVPHLTR DRLTDLLTNQ RPHIVHHIGH GIIQSDMSYL DLERADQSRD
QLSAREFSSM LHQSGVQLVV LNACHTGSAG ENLLTSFAPI FITDRIPAAI GMQAAILNRT
GHCFANAFYA ALGNSGSIDA SLIAARKAIL ADGHEHGAWG FPTLYSRVPH SQLWGYRTPQ
PVDPLTQRDR ERITKAAQQI ESDRLLNLRL QGFVGRVNEL AAMREHIEAM RPTGGYVLIK
AAAGEGKSSS IAKLIQEAGI AQTPHHFIAL TTGRDYQLGL LRAVVAQLIL KHNLTVSYFP
EESYPAMKGE FARILDDLSK QGIQETIYLD GLDQLQPERD GSRDLSFLPP QPPPGIVIVL
GSRPDETLKP LEILHRVDYA LPPLSEGDAL ALWRSVQPAV ADSLLHDLYT ALKGNALFVY
LAADTMRDQS VVDATSLIRR IEQNPHNLFG ITLERIKNRS EHRWPTIWKP MLALLLVAQE
PLRLDVLGDL LEHDHDTMQD AVWVLGGLVS QGIDQRVALH HLLFRDFLAA SVFNDREVKR
WHQRLADWCA QDVDAIWADH PDTMEQARRI YARYHYIIHL ALAENWTTLW QVLDTGDYGE
QKTRFDPSTR LYALDLDRGR ESAITAGRST EEHIQNLPRL WKYSLLRTSL TSRVDQWPDE
VFEVLAIVGR THEALERIEL LSSNENQISC WLRILPWCDN KQQHLLVMRL DEVCRHLSGF
DKKMDAIRAI ANAAATHGQR EQALAILTTA VSIAHSIDHP YYRSDALATI AKDTATYGQR
EQALAILTTA ASIAHSIDDP SDKSAALATI AIAAATLGQR EQALAILTTA VSIAHSIDDP
YEKSDALATI AIAAATLGQR EQALAILATA VSIAHSIDHP YYRSDALATI AKDTATYGAI
DQALSIAHSI DNPYYRSDAL ATIANAAATH GQREQALAIL TTAVSIAQSI DGFNKSAVLA
TFANAAATYG AIDQALSIAH SIDGLNRSDA LATVAKGIAT YGAIDQALSI AHSIDHPYYR
SDALATIANA AATHGQREQA LAILTTAVSI AQSIDGFNKS AVLATFAKDT ATYGAIDQAL
SIAHSIDHPL QRRDVLLIIA ATAATHGQQE QALAILATAV SIAHSIDDPS HKSAVLETIA
KDTATYGAID HAVSITQSID SPYHKSKALA SIANAAATLG QREQALAILT TAVSIAHSID
HSYHKSAVLA TFANAAATYG AIDQALSIAH SIDGLINRSD ALATVAKGTA TYGAIDQALS
IAHSIDHPLQ RRDVLLIIAA TAATHGQREQ ALAILATAVS IAHSIDIPWQ KSDALATIAK
DTATYGAIDH AVSITHSIDD LFYKSDALFI IAATAATLGQ QEQALAILAT AVSIAQSIDS
LYYLSDALAT IVNAAATHGQ REQALAIAHS IDDPSHKSAV LATIAIAAAT HGQREQALAI
AHSIDDPYHK SDALTTIANA AVTHGQREQA LAILTTAASI AHSIDDPYYK SGALATIANA
AATLGQREQA LAILATAASI TQSIDDPYEK SDALATIANA AATLGQREQA LAILATAVSI
THSIDSPSHK STVLATIVNA ATHSAIDHAV SITQSIDSPY HKSKALASIA NAAATLGQRE
QALAILTTAV SITHSIDSPS HKSTVLATIA NAAATYGAID QALSIAHSID DPLQRHAVLA
TIAAADASQD AIERALSIAH SIDNLDHRAE TFRIILQKDL SVIDVLTSIQ HEWFRSKIPQ
DLWTMTPMIA PLLNDYPWLG TVILEEEAWV NEQLKRLG