Gene Haur_5145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5145 
Symbol 
ID5737103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp197993 
End bp203062 
Gene Length5070 bp 
Protein Length1689 aa 
Translation table11 
GC content48% 
IMG OID641282310 
Producthypothetical protein 
Protein accessionYP_001547901 
Protein GI159901655 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0775613 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGTG CGTTTGCATC CTTCCATCTC ACCATTGCCG CCCCCCACGG GGATCGCTAT 
CCTGTGACCG CCCGCACCCA GGCAGGCCAT GAGGTCAGCG AAGATCTGCT GCTGCCGCTT
GATGATCCGA CCCTGACCGT CTACCAGATG GCGCTGGATT ATCACACACC GATTGACGAG
TCCGTCGTGA TCGCGGTTGG CCAACTTCTG TACCAAACCC TGTTTCAAGG AACCATTGCC
GAAGCCTTTG CCACGGCTCG CGCCCATGCT GACCAACAAA AGGTGGCGTT GCGTATCCAT
TTGGCCATTG ATACCGATAC CCGCCTGAGC GCGGCTGCCG CTCTACCTTG GGAATTGATG
GCCACCGCTG CGGGCCGTCC CCTCATGCTG GAACATGCCT TGGTACGAAC CTTTTCGTGG
AATGATCCGA TTCCTAATTT GGGCATTCCC CCTGGTGAGC CTATTCGCCT CGCCGTGACC
TCGGCCTTGC CTGCGGAGTT GGCAAACCAC CCGATTGCGG CGGAAGCCGA AGTTGCCATC
ATCCATGCTG CCATCACGCA CAGCGCACGA CCAATCGATC TGATCGAAGT CCCCCATCTC
ACCCGCGACC GCTTGACCGA TCTGCTCACC AACCAGCGAC CGCATATCGT GCATCATATC
GGCCATGGCA GTATCCAACG CGGCATGGGC TACCTCGACA TCGAACGCGC CGATCAATCG
CGGGATCGGC TCTCGGCCCG CGAGTTCAGC ACCATGCTCC ATCAGTCGGG GGTGCAACTG
GTCGTCTTGA ATGCCTGCCA CACCAGCAGT GCTGGGGAGA GCCTGCTCAC GAGCTTTGCC
CCGATTTTCA TCACCGATCG CATTCCCGCC GTGATTGGCA TGCAAGCCGC AATCCTGAAT
CGGACTGGCC ACTGCTTTGC AAACGCCTTC TATGCCACTC TCGGACACAG TGGTTCAATT
GATGCCAGCC TGATTGCGGC TCGCAAGGCG ATTCATGCCG ATGGCCATGA GCATGGGGCA
TGGGGATTTA TAACCCTGTA TAGTCGGGTT ACGCATGGTG GGTTATGGCA GACGCAGCAT
GCCGATCACA CCAACTCCAC CTCCCCACCA ACCGTGATTC AAAACACGAT AGAGCCGATC
CAAGCCCACG GTTCTAATAT CTTGATCGGC AATCAGATTC AAGGCAATGT CTATCAACAT
GTTGATCTTC CTGAAGCGAC ATTAAAAGCG TCACTTGCGG CACTCGAACA GGAAAAAACA
CTGGCGAGGA TTCGCCATGC AGCACAACAA ATCGAGAGTG ATCGCCTGCT CACCCTGCGC
CTTCAAGGCT TCGTCGGTCG GGTCAACGAA CTGGCGGCGA TTCGCGAGCA GATTGCAGCA
ATGCGGCCTA CAGGTGGCTA TGTTTTGATC AAAGCGGCAG CAGGCGAAGG CAAGAGCAGT
AGTATTGCCA AGTTGATTCA AGAAGCGGGG ATTGCACAGA CTCCGCACCA CTTTATTGCC
CTGACCACGG GCCGCGAATA TCAATTGGGC TTGCTTCGGG CGGTGGTAGC ACAGTTGATT
CTCAAGCATG GACTGACGGT TTCGTACTTC CCCGAGGAAA GCTATCCGGC GATGAAGGGG
GAATTTACGC GAATCCTCGA CGAGCTTTCC AAGCAGGGCA TTCAGGAAAC GATCTATCTA
GATGGCCTCG ATCAACTCCA ACCCGACATT GACGGCTCGC GTGACTTGTC GTTTCTGCCG
CCACAGCCGC CTCCTGGCAT CGTGATGGTG CTTGGCTCAC GGCCTGATGA GACCTTGAAG
CCGCTCGAAA TTCTGCATCG GGTGGACTAC GACCTGCCAC CACTGCGTGA AGATGATGCG
CTCGCGTTGT GGCGATCGGT CCAGCCAGGT GTGTCGGATA GCCTCCTGCA TAACCTCTAT
ACCGCACTCA AGGGCAATGC GCTGTTTGTC CACTTGGCGG CGGATACAAT GCAGGGTGCG
TCCGTAGTCG ATGCGACCAG TTTGATTAAA CAGATTGAGC AGAATCCCAG CAATCTGTTT
GGGATTACCT TGGAGCGGAT TAAAGGTCGA TCAATGTCTG ACTGGCGGTC GATTTGGAAG
CCGATGCTGG CACTGTTGCT CGTTGCTCAA GAACCATTGC GGCTGGATGT GCTGGGCGAT
CTGCTGGGGC ACGACCACGA CACGATGCAG GATGCCGTGT GGGTTTTTGG GGGATTAGTC
AGTCAGGGCA TTGATCAGCG GGTTGCCCTG CATCACTTGT TGTTTCGCGA CTATTTGACG
ACATCGGTGT TTAATGATCG TGAGGTGAAA CGCTGGCATC AACGACTAGC TGACTGGTGT
GATAGCGATC TGGACGCGAT TTGGGCTGAT GATCGTGATC CCATTGAGCA GGCACGGCGG
GTCTATGCGC GGCATCACTA TATCATGCAT TTATTCTTGG CGGAAAACTG GACAACACTC
TGGAAGGTCT TGGATACGGG CGACTATGGT GAATACAAAA CCCGCTTCGA TCCGAGTACC
CGACTCTATG CGCTGGATTT GGATCGCGGG CGAGAGAGTG CCATCAACGC GGGGCAATCG
ACTAAAGAAA ATATCCAGAA CCTGCCTCGG CTGTGGAAGT ATAGTTTGTT GCGAACGAGT
CTTAATAATA GGATGGATCA GAGTTCGGAT GAATTGTTTG TTATTTTAGC AATGCTTGGG
CGGTTAGAGG AATCTTTAGC CCGTATTGAA TTAATATCTG ATCCAATCAG ACAAATCGGC
TTATGGTCAA TCGTCGTCCA ATGGTGCGAT CCTCATCAGC AAAAAATACT TCTTTATCGA
ATGGAATCTC TTCTCCCAAT GATTCCAGTA CAAAGTAAAC AAGAGGCGTT ACAAAGCATC
ATACAGGTAT GCATTTGGAT TGGTGATCTT GATCAAGCAT ATAACCTTGC ACAGACAATC
GACGATAATG AACAGCGGGC AAGCATCTTA TGCGACATTG CACAAGCCCA TCCTGAGTCT
TATCCCACAG ATCAATTAGA CAACCTTTTA AATGAAGCAT TCTTGTTGGT AGAGTTTATC
AACGATCCGA ATAGTGCTAG CCCTCTCTAC CATCGAATTG TCACGCTGTT GACTAATAGA
TCTATGATCG TTCAAGCGGC AGCTCTTGCC GATAAAATTG ATAACCCTTG GCAACGAGCA
AGAACACTAT ATGACATTGT GTGGCTTCTT GTACAAAAAC ATGAGTGTAT TCCTGCACTG
ACTATAGCCT ATATGATTGA AGATTCGTCC TATCTGCTGC ATGCTATGAT TACTATTACG
CTTGCATTTG CGGAGGCCGG TGATTCTGAA CAAGTTGAAA GGCTTCTATA TAAAATTCTT
CAGGATACCG CTACCATCAG GCATATTGAA CAGTCTATCC AAGTTTATAG TGCTATTGCA
GAAATATATA CAAAAGTGGG GAATGCCAAA CAGGCAAATA GTTATTTCCA TGCCATAGAA
ACTTTGATTC ATTCCATAGA CGCGCCTGAA AAACAGGTGG ATGAACTTTG CTTCATGGCC
AAAACGTACA ACCGAATGAA GATGGATATG TTGAGGGATA CATATCTTGA TAATGCCATA
GCCCTTGCAC ACACGATAGA TGAGCCATCA GCGCAAGGGA ATGCCTTTAA AGCTATCAGT
AAAGCCTATA CAGTTCTTGG GGATCTGAGC CGTGCAATAA CAATAAGCAC ACTGATTGCA
GATTATGATA TACGTGAGAC TACGCTTGGC CAAGTTATTC AGATCGTTCT AAACAATCCA
TCTAATGGCA ATTCACAAGC GGTCTTAAAT GAAGTTAGGA ATATCGCACA ATCGATCAAC
CACTCATGGT GGCGAATAAA ATCGACTTAT TCGCTGATAT ATACCTATGT TAACAATGGC
AATCTAGATC AGGCACACAA ACTACTGGGG TCAGAGTTCC AAACCCTATC TTATGCGGTT
GATGAAGATT CTTTATCCAC ACTGGAAAAA ACACGGGTTT GTGTCTTATG TTCAATAGCT
GCTGCCAGCT TTGCGATTCA GAATCTAGGA GTTTCCTATA CTCTGATGGA TGAAGCAATA
ACCATAACGA AGCGTATTCT TGAACCTCGC ACACACATTC ATGCTGTAGA AACGATCGCA
CAAACTTATG CTGTAATGAA TGATCATAAT CAGGCAACCA TATTTCTACA GCACGCATTG
GAGATGGCGA AATCTATAGA CGATGGTGAT CTACAGTTCG AATTGATAGA TTCTATTGCA
ACTATAAGTA TACGCATTGG GAATATTGAA CAAGCTATGA CGATTCTTGA ATCGGTCAAT
ACTTATGATC GATTTGGTAG TTCATCTATA CTCCCAGGTA TTGCAATTGA ATTTGCCGAT
AGAGGTCAAA TTGAACAAGC AATAGAATTT AGTCAATCTG TTAAATCGAG AGAGAAAGAT
TTTGTATATC GTGCTATCGC TCAGGCATAT TGTACGGCTG GCGATCAAGA ACGGGCTAAA
CATATTACAC AATCAATCAA GACAATGTGG AAATATATTG ATACCTTAAG TATCATTGCA
AGAAGCTATG TTTCTGTCGA TAAGATAGAA CAAGCAAAAG ATCTTATTGC AGAAATGAAA
ACACGTATAA CCATCCTACC CAACGACAAT GATCGTGATT ATGCATATGG ATCACTGTCA
CAGGCACTAG CGACGATAGG ACAAATAACC CAGGCTATTG AAACTATTCA ATTAATCAAT
ACCACAGGGA GCCGTGATGA AGCGATCTAT ACACTTGCGC AGGTATATGC GGCGCAGGGG
AATATCACCC ATGCACTTGC TGAAGCGAAA TCGATTACCC ATATTAGAAG AAGGATAGCT
TTGTTCTTAT CACTAGGCAA TGAGTATCGA GATGATGATT CATTAAAAAT ATCCCTTATT
CAAAAAGAAT GGCAGGCCAG TAGAACAATC GAAGATACTT GGGAGCTATT ACAACTGATT
AATCCCTTAT TAAACGACTA TCCATGGCTT GGAACAGTAA TTCTGGAAGA AGAGAAATGG
GTCAATGCGC AACTCAAGCG ATTAGGGTAA
 
Protein sequence
MSSAFASFHL TIAAPHGDRY PVTARTQAGH EVSEDLLLPL DDPTLTVYQM ALDYHTPIDE 
SVVIAVGQLL YQTLFQGTIA EAFATARAHA DQQKVALRIH LAIDTDTRLS AAAALPWELM
ATAAGRPLML EHALVRTFSW NDPIPNLGIP PGEPIRLAVT SALPAELANH PIAAEAEVAI
IHAAITHSAR PIDLIEVPHL TRDRLTDLLT NQRPHIVHHI GHGSIQRGMG YLDIERADQS
RDRLSAREFS TMLHQSGVQL VVLNACHTSS AGESLLTSFA PIFITDRIPA VIGMQAAILN
RTGHCFANAF YATLGHSGSI DASLIAARKA IHADGHEHGA WGFITLYSRV THGGLWQTQH
ADHTNSTSPP TVIQNTIEPI QAHGSNILIG NQIQGNVYQH VDLPEATLKA SLAALEQEKT
LARIRHAAQQ IESDRLLTLR LQGFVGRVNE LAAIREQIAA MRPTGGYVLI KAAAGEGKSS
SIAKLIQEAG IAQTPHHFIA LTTGREYQLG LLRAVVAQLI LKHGLTVSYF PEESYPAMKG
EFTRILDELS KQGIQETIYL DGLDQLQPDI DGSRDLSFLP PQPPPGIVMV LGSRPDETLK
PLEILHRVDY DLPPLREDDA LALWRSVQPG VSDSLLHNLY TALKGNALFV HLAADTMQGA
SVVDATSLIK QIEQNPSNLF GITLERIKGR SMSDWRSIWK PMLALLLVAQ EPLRLDVLGD
LLGHDHDTMQ DAVWVFGGLV SQGIDQRVAL HHLLFRDYLT TSVFNDREVK RWHQRLADWC
DSDLDAIWAD DRDPIEQARR VYARHHYIMH LFLAENWTTL WKVLDTGDYG EYKTRFDPST
RLYALDLDRG RESAINAGQS TKENIQNLPR LWKYSLLRTS LNNRMDQSSD ELFVILAMLG
RLEESLARIE LISDPIRQIG LWSIVVQWCD PHQQKILLYR MESLLPMIPV QSKQEALQSI
IQVCIWIGDL DQAYNLAQTI DDNEQRASIL CDIAQAHPES YPTDQLDNLL NEAFLLVEFI
NDPNSASPLY HRIVTLLTNR SMIVQAAALA DKIDNPWQRA RTLYDIVWLL VQKHECIPAL
TIAYMIEDSS YLLHAMITIT LAFAEAGDSE QVERLLYKIL QDTATIRHIE QSIQVYSAIA
EIYTKVGNAK QANSYFHAIE TLIHSIDAPE KQVDELCFMA KTYNRMKMDM LRDTYLDNAI
ALAHTIDEPS AQGNAFKAIS KAYTVLGDLS RAITISTLIA DYDIRETTLG QVIQIVLNNP
SNGNSQAVLN EVRNIAQSIN HSWWRIKSTY SLIYTYVNNG NLDQAHKLLG SEFQTLSYAV
DEDSLSTLEK TRVCVLCSIA AASFAIQNLG VSYTLMDEAI TITKRILEPR THIHAVETIA
QTYAVMNDHN QATIFLQHAL EMAKSIDDGD LQFELIDSIA TISIRIGNIE QAMTILESVN
TYDRFGSSSI LPGIAIEFAD RGQIEQAIEF SQSVKSREKD FVYRAIAQAY CTAGDQERAK
HITQSIKTMW KYIDTLSIIA RSYVSVDKIE QAKDLIAEMK TRITILPNDN DRDYAYGSLS
QALATIGQIT QAIETIQLIN TTGSRDEAIY TLAQVYAAQG NITHALAEAK SITHIRRRIA
LFLSLGNEYR DDDSLKISLI QKEWQASRTI EDTWELLQLI NPLLNDYPWL GTVILEEEKW
VNAQLKRLG