Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5145 |
Symbol | |
ID | 5737103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 197993 |
End bp | 203062 |
Gene Length | 5070 bp |
Protein Length | 1689 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641282310 |
Product | hypothetical protein |
Protein accession | YP_001547901 |
Protein GI | 159901655 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0775613 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGTG CGTTTGCATC CTTCCATCTC ACCATTGCCG CCCCCCACGG GGATCGCTAT CCTGTGACCG CCCGCACCCA GGCAGGCCAT GAGGTCAGCG AAGATCTGCT GCTGCCGCTT GATGATCCGA CCCTGACCGT CTACCAGATG GCGCTGGATT ATCACACACC GATTGACGAG TCCGTCGTGA TCGCGGTTGG CCAACTTCTG TACCAAACCC TGTTTCAAGG AACCATTGCC GAAGCCTTTG CCACGGCTCG CGCCCATGCT GACCAACAAA AGGTGGCGTT GCGTATCCAT TTGGCCATTG ATACCGATAC CCGCCTGAGC GCGGCTGCCG CTCTACCTTG GGAATTGATG GCCACCGCTG CGGGCCGTCC CCTCATGCTG GAACATGCCT TGGTACGAAC CTTTTCGTGG AATGATCCGA TTCCTAATTT GGGCATTCCC CCTGGTGAGC CTATTCGCCT CGCCGTGACC TCGGCCTTGC CTGCGGAGTT GGCAAACCAC CCGATTGCGG CGGAAGCCGA AGTTGCCATC ATCCATGCTG CCATCACGCA CAGCGCACGA CCAATCGATC TGATCGAAGT CCCCCATCTC ACCCGCGACC GCTTGACCGA TCTGCTCACC AACCAGCGAC CGCATATCGT GCATCATATC GGCCATGGCA GTATCCAACG CGGCATGGGC TACCTCGACA TCGAACGCGC CGATCAATCG CGGGATCGGC TCTCGGCCCG CGAGTTCAGC ACCATGCTCC ATCAGTCGGG GGTGCAACTG GTCGTCTTGA ATGCCTGCCA CACCAGCAGT GCTGGGGAGA GCCTGCTCAC GAGCTTTGCC CCGATTTTCA TCACCGATCG CATTCCCGCC GTGATTGGCA TGCAAGCCGC AATCCTGAAT CGGACTGGCC ACTGCTTTGC AAACGCCTTC TATGCCACTC TCGGACACAG TGGTTCAATT GATGCCAGCC TGATTGCGGC TCGCAAGGCG ATTCATGCCG ATGGCCATGA GCATGGGGCA TGGGGATTTA TAACCCTGTA TAGTCGGGTT ACGCATGGTG GGTTATGGCA GACGCAGCAT GCCGATCACA CCAACTCCAC CTCCCCACCA ACCGTGATTC AAAACACGAT AGAGCCGATC CAAGCCCACG GTTCTAATAT CTTGATCGGC AATCAGATTC AAGGCAATGT CTATCAACAT GTTGATCTTC CTGAAGCGAC ATTAAAAGCG TCACTTGCGG CACTCGAACA GGAAAAAACA CTGGCGAGGA TTCGCCATGC AGCACAACAA ATCGAGAGTG ATCGCCTGCT CACCCTGCGC CTTCAAGGCT TCGTCGGTCG GGTCAACGAA CTGGCGGCGA TTCGCGAGCA GATTGCAGCA ATGCGGCCTA CAGGTGGCTA TGTTTTGATC AAAGCGGCAG CAGGCGAAGG CAAGAGCAGT AGTATTGCCA AGTTGATTCA AGAAGCGGGG ATTGCACAGA CTCCGCACCA CTTTATTGCC CTGACCACGG GCCGCGAATA TCAATTGGGC TTGCTTCGGG CGGTGGTAGC ACAGTTGATT CTCAAGCATG GACTGACGGT TTCGTACTTC CCCGAGGAAA GCTATCCGGC GATGAAGGGG GAATTTACGC GAATCCTCGA CGAGCTTTCC AAGCAGGGCA TTCAGGAAAC GATCTATCTA GATGGCCTCG ATCAACTCCA ACCCGACATT GACGGCTCGC GTGACTTGTC GTTTCTGCCG CCACAGCCGC CTCCTGGCAT CGTGATGGTG CTTGGCTCAC GGCCTGATGA GACCTTGAAG CCGCTCGAAA TTCTGCATCG GGTGGACTAC GACCTGCCAC CACTGCGTGA AGATGATGCG CTCGCGTTGT GGCGATCGGT CCAGCCAGGT GTGTCGGATA GCCTCCTGCA TAACCTCTAT ACCGCACTCA AGGGCAATGC GCTGTTTGTC CACTTGGCGG CGGATACAAT GCAGGGTGCG TCCGTAGTCG ATGCGACCAG TTTGATTAAA CAGATTGAGC AGAATCCCAG CAATCTGTTT GGGATTACCT TGGAGCGGAT TAAAGGTCGA TCAATGTCTG ACTGGCGGTC GATTTGGAAG CCGATGCTGG CACTGTTGCT CGTTGCTCAA GAACCATTGC GGCTGGATGT GCTGGGCGAT CTGCTGGGGC ACGACCACGA CACGATGCAG GATGCCGTGT GGGTTTTTGG GGGATTAGTC AGTCAGGGCA TTGATCAGCG GGTTGCCCTG CATCACTTGT TGTTTCGCGA CTATTTGACG ACATCGGTGT TTAATGATCG TGAGGTGAAA CGCTGGCATC AACGACTAGC TGACTGGTGT GATAGCGATC TGGACGCGAT TTGGGCTGAT GATCGTGATC CCATTGAGCA GGCACGGCGG GTCTATGCGC GGCATCACTA TATCATGCAT TTATTCTTGG CGGAAAACTG GACAACACTC TGGAAGGTCT TGGATACGGG CGACTATGGT GAATACAAAA CCCGCTTCGA TCCGAGTACC CGACTCTATG CGCTGGATTT GGATCGCGGG CGAGAGAGTG CCATCAACGC GGGGCAATCG ACTAAAGAAA ATATCCAGAA CCTGCCTCGG CTGTGGAAGT ATAGTTTGTT GCGAACGAGT CTTAATAATA GGATGGATCA GAGTTCGGAT GAATTGTTTG TTATTTTAGC AATGCTTGGG CGGTTAGAGG AATCTTTAGC CCGTATTGAA TTAATATCTG ATCCAATCAG ACAAATCGGC TTATGGTCAA TCGTCGTCCA ATGGTGCGAT CCTCATCAGC AAAAAATACT TCTTTATCGA ATGGAATCTC TTCTCCCAAT GATTCCAGTA CAAAGTAAAC AAGAGGCGTT ACAAAGCATC ATACAGGTAT GCATTTGGAT TGGTGATCTT GATCAAGCAT ATAACCTTGC ACAGACAATC GACGATAATG AACAGCGGGC AAGCATCTTA TGCGACATTG CACAAGCCCA TCCTGAGTCT TATCCCACAG ATCAATTAGA CAACCTTTTA AATGAAGCAT TCTTGTTGGT AGAGTTTATC AACGATCCGA ATAGTGCTAG CCCTCTCTAC CATCGAATTG TCACGCTGTT GACTAATAGA TCTATGATCG TTCAAGCGGC AGCTCTTGCC GATAAAATTG ATAACCCTTG GCAACGAGCA AGAACACTAT ATGACATTGT GTGGCTTCTT GTACAAAAAC ATGAGTGTAT TCCTGCACTG ACTATAGCCT ATATGATTGA AGATTCGTCC TATCTGCTGC ATGCTATGAT TACTATTACG CTTGCATTTG CGGAGGCCGG TGATTCTGAA CAAGTTGAAA GGCTTCTATA TAAAATTCTT CAGGATACCG CTACCATCAG GCATATTGAA CAGTCTATCC AAGTTTATAG TGCTATTGCA GAAATATATA CAAAAGTGGG GAATGCCAAA CAGGCAAATA GTTATTTCCA TGCCATAGAA ACTTTGATTC ATTCCATAGA CGCGCCTGAA AAACAGGTGG ATGAACTTTG CTTCATGGCC AAAACGTACA ACCGAATGAA GATGGATATG TTGAGGGATA CATATCTTGA TAATGCCATA GCCCTTGCAC ACACGATAGA TGAGCCATCA GCGCAAGGGA ATGCCTTTAA AGCTATCAGT AAAGCCTATA CAGTTCTTGG GGATCTGAGC CGTGCAATAA CAATAAGCAC ACTGATTGCA GATTATGATA TACGTGAGAC TACGCTTGGC CAAGTTATTC AGATCGTTCT AAACAATCCA TCTAATGGCA ATTCACAAGC GGTCTTAAAT GAAGTTAGGA ATATCGCACA ATCGATCAAC CACTCATGGT GGCGAATAAA ATCGACTTAT TCGCTGATAT ATACCTATGT TAACAATGGC AATCTAGATC AGGCACACAA ACTACTGGGG TCAGAGTTCC AAACCCTATC TTATGCGGTT GATGAAGATT CTTTATCCAC ACTGGAAAAA ACACGGGTTT GTGTCTTATG TTCAATAGCT GCTGCCAGCT TTGCGATTCA GAATCTAGGA GTTTCCTATA CTCTGATGGA TGAAGCAATA ACCATAACGA AGCGTATTCT TGAACCTCGC ACACACATTC ATGCTGTAGA AACGATCGCA CAAACTTATG CTGTAATGAA TGATCATAAT CAGGCAACCA TATTTCTACA GCACGCATTG GAGATGGCGA AATCTATAGA CGATGGTGAT CTACAGTTCG AATTGATAGA TTCTATTGCA ACTATAAGTA TACGCATTGG GAATATTGAA CAAGCTATGA CGATTCTTGA ATCGGTCAAT ACTTATGATC GATTTGGTAG TTCATCTATA CTCCCAGGTA TTGCAATTGA ATTTGCCGAT AGAGGTCAAA TTGAACAAGC AATAGAATTT AGTCAATCTG TTAAATCGAG AGAGAAAGAT TTTGTATATC GTGCTATCGC TCAGGCATAT TGTACGGCTG GCGATCAAGA ACGGGCTAAA CATATTACAC AATCAATCAA GACAATGTGG AAATATATTG ATACCTTAAG TATCATTGCA AGAAGCTATG TTTCTGTCGA TAAGATAGAA CAAGCAAAAG ATCTTATTGC AGAAATGAAA ACACGTATAA CCATCCTACC CAACGACAAT GATCGTGATT ATGCATATGG ATCACTGTCA CAGGCACTAG CGACGATAGG ACAAATAACC CAGGCTATTG AAACTATTCA ATTAATCAAT ACCACAGGGA GCCGTGATGA AGCGATCTAT ACACTTGCGC AGGTATATGC GGCGCAGGGG AATATCACCC ATGCACTTGC TGAAGCGAAA TCGATTACCC ATATTAGAAG AAGGATAGCT TTGTTCTTAT CACTAGGCAA TGAGTATCGA GATGATGATT CATTAAAAAT ATCCCTTATT CAAAAAGAAT GGCAGGCCAG TAGAACAATC GAAGATACTT GGGAGCTATT ACAACTGATT AATCCCTTAT TAAACGACTA TCCATGGCTT GGAACAGTAA TTCTGGAAGA AGAGAAATGG GTCAATGCGC AACTCAAGCG ATTAGGGTAA
|
Protein sequence | MSSAFASFHL TIAAPHGDRY PVTARTQAGH EVSEDLLLPL DDPTLTVYQM ALDYHTPIDE SVVIAVGQLL YQTLFQGTIA EAFATARAHA DQQKVALRIH LAIDTDTRLS AAAALPWELM ATAAGRPLML EHALVRTFSW NDPIPNLGIP PGEPIRLAVT SALPAELANH PIAAEAEVAI IHAAITHSAR PIDLIEVPHL TRDRLTDLLT NQRPHIVHHI GHGSIQRGMG YLDIERADQS RDRLSAREFS TMLHQSGVQL VVLNACHTSS AGESLLTSFA PIFITDRIPA VIGMQAAILN RTGHCFANAF YATLGHSGSI DASLIAARKA IHADGHEHGA WGFITLYSRV THGGLWQTQH ADHTNSTSPP TVIQNTIEPI QAHGSNILIG NQIQGNVYQH VDLPEATLKA SLAALEQEKT LARIRHAAQQ IESDRLLTLR LQGFVGRVNE LAAIREQIAA MRPTGGYVLI KAAAGEGKSS SIAKLIQEAG IAQTPHHFIA LTTGREYQLG LLRAVVAQLI LKHGLTVSYF PEESYPAMKG EFTRILDELS KQGIQETIYL DGLDQLQPDI DGSRDLSFLP PQPPPGIVMV LGSRPDETLK PLEILHRVDY DLPPLREDDA LALWRSVQPG VSDSLLHNLY TALKGNALFV HLAADTMQGA SVVDATSLIK QIEQNPSNLF GITLERIKGR SMSDWRSIWK PMLALLLVAQ EPLRLDVLGD LLGHDHDTMQ DAVWVFGGLV SQGIDQRVAL HHLLFRDYLT TSVFNDREVK RWHQRLADWC DSDLDAIWAD DRDPIEQARR VYARHHYIMH LFLAENWTTL WKVLDTGDYG EYKTRFDPST RLYALDLDRG RESAINAGQS TKENIQNLPR LWKYSLLRTS LNNRMDQSSD ELFVILAMLG RLEESLARIE LISDPIRQIG LWSIVVQWCD PHQQKILLYR MESLLPMIPV QSKQEALQSI IQVCIWIGDL DQAYNLAQTI DDNEQRASIL CDIAQAHPES YPTDQLDNLL NEAFLLVEFI NDPNSASPLY HRIVTLLTNR SMIVQAAALA DKIDNPWQRA RTLYDIVWLL VQKHECIPAL TIAYMIEDSS YLLHAMITIT LAFAEAGDSE QVERLLYKIL QDTATIRHIE QSIQVYSAIA EIYTKVGNAK QANSYFHAIE TLIHSIDAPE KQVDELCFMA KTYNRMKMDM LRDTYLDNAI ALAHTIDEPS AQGNAFKAIS KAYTVLGDLS RAITISTLIA DYDIRETTLG QVIQIVLNNP SNGNSQAVLN EVRNIAQSIN HSWWRIKSTY SLIYTYVNNG NLDQAHKLLG SEFQTLSYAV DEDSLSTLEK TRVCVLCSIA AASFAIQNLG VSYTLMDEAI TITKRILEPR THIHAVETIA QTYAVMNDHN QATIFLQHAL EMAKSIDDGD LQFELIDSIA TISIRIGNIE QAMTILESVN TYDRFGSSSI LPGIAIEFAD RGQIEQAIEF SQSVKSREKD FVYRAIAQAY CTAGDQERAK HITQSIKTMW KYIDTLSIIA RSYVSVDKIE QAKDLIAEMK TRITILPNDN DRDYAYGSLS QALATIGQIT QAIETIQLIN TTGSRDEAIY TLAQVYAAQG NITHALAEAK SITHIRRRIA LFLSLGNEYR DDDSLKISLI QKEWQASRTI EDTWELLQLI NPLLNDYPWL GTVILEEEKW VNAQLKRLG
|
| |