Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0103 |
Symbol | |
ID | 5731996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 132307 |
End bp | 135936 |
Gene Length | 3630 bp |
Protein Length | 1209 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641277225 |
Product | WD-40 repeat-containing protein |
Protein accession | YP_001542883 |
Protein GI | 159896636 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.572371 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAAACG TTGTTTTATC GTTGCCTGTT TTTCATGCAA CCGAAGGGCT TGGCGCATTT CTCGGCGATT TGCGTTCATG GGTTTTTCGT TCAAAAAATC GGGCCGCCCG CCACTTTGGC TTGGCGCATA CCACAATTAT GCGCTATGAA AATGACCAAA TTTTGTGTCC GCTGGGCTAT ATCGCCGCCC TCGCGCAGTT GGTGATCGAG CAGTTGGATC TGCCGCCCTA TCAGCGTGGG TTGGCCGAGC AGCAATTATT AGCCACGATT CAATATGCAT TAACTGAGTA TGCGATTGAC CACACGCCGC TGGCAACGTG GCCTGAGTTA ACAAATTTGG CCGCAGCTTA CTTGGCTGAA GTGCAGCAGC AAAAGCAGGA GCAGGCTAAA ACTGGTCCAT TGATGGGCGT TTTGCACGAT TGGGGCGATG CGCCCGATGT ACAGAATTTT GTCGGTCGCG AGGACGAAAC TGCTACCCTT GTTAAATGGT TACAGCTCGA TCGTTGCCGT TTGGTGGCGA TTATTGGGTT GGGCGGTATG GGCAAAACCA GCCTTGCCAC CCGCGTTGCC CAACAAGCCC AAGATGATTT TAAGGTGATC GTCTGGCGTT CGCTGCAACA AGGTCAACAG GCCAATGATT TTCTGTTGGA ATGTTTGCAC CGGATTATGC CCAGTCCCAA TTCGGCCTAT CCAAGTCAAT TTGAGCACCG CCTAAGTGTG TTGATCGATT ATTTGCGTAC CACCCGCTGC TTGTTAATTC TCGACAATAT CGAGGCTATT TTGCAGCCGC AATATCCAGC TGGCCGCTAC CGCGAGGGCT ATGAGCAATA TGCCCAACTG TTTCAAGCAA TCAGCGAGCG TTCCCACGAA AGCTGTTTGA TTCTGACGAG CCGCGAAAAA CCCTATGAAT TCAATCGACT CGAAGGTGTG CATACCCGTT CGATGGTTTT GACGGGACTT ATGCGCGATG ATGCTCAAAT GTTGCTCGAT AATCAAGAGT TGTATGGCAC GCCGCAACTG TGGCAGGAGC TCATCAAGCA CTATACTGGC AATCCTTTGG CCTTAAAATT AGTTGCCCAA GTGATCAAAA CCATGTTTTT TGGCCAAATT GCTGAATTTT TGCAGCACGA AGAATTAATT TTTGGCGATG TGCGCACAAT TTTGGCTCAG CAATTTGAAC GTTTATCCGA CCAAGAGCAA GAATTATTGT ATTGGCTCGC AATCGAACGC CACACGGTCA AATTAGCTGA GCTTAAGCAT GACCTGGTGC GTTCAAAATA TCAACATATG CTGCTCGAAA CCCTTGAATC GTTGCTGCGC CGTTCGTTGG TGGAGCGGCA TCAAGATGGG TTTATGTTGC ATAATGTGGT GCTCGAATAT ACAACCGACC GTTTGATCGA CCAAATTGCC CAAGAATTAC TCGATGGGAC GCAGGGTTTG CTCTATCGCC ATGCCTTGAT CAAAGCCAAC AGCTTGGATA GCATTCGCGA ACACCAAAGT CGGGCAATTT TGCGGCCATT GCTGCACCGG ATTTTCGTTG AGCTTGGTCA AGAGCGTTTG CTGGCAAGCC TACGTCAATT ATTGCAAACC ATGCAGCCCT TGAGCGCATT GGAAATGGGC TATGCGCCAG GTAATATTTT TAATTTATTG GTTGAACTCA AGGCCGATTT GAGCCAATTT GATTTTCGGC ATAAACCGCT GTGGCACGCC AACCTACGTG GCATCAACCC CAAACAGCTT GATTTGAGCC ATAGCGACCT TTCGCGCAGT GTGTTTAGCG AACAATTTGG CGCATTGATT GCCCTAGCCC GCGATCCAGC TGATCGTTTT TTGGCCGTAG CAACCGCTGA TGATCAATTG ATTGTTTGGC AAAACTTAGA TCTGAAAAAA CTTTGGCAAG TACCCAGTAA CCACGATGGC ATTCGGGCAA TCTGTTTTAG CGGCGATGGG CGCTACTTGA TCAGCGCTGG TAACGATGGA CTCATTCGGC TGTGGGAGAC CAGTCAAGGC CAGAACCCAC GTATTTTAGC AGGCCATACC CGACCAGTAA TTGGCGTGGC GATTGCGCCC CAAAGTCAAC AATTAATCAG TGCCAGCCTT GATGGTGAGG TTCGCCTGTG GGATCGGCTC AGTGGCAAGT GTTTGCATCG TTTTAATGCG CATGCCGACG GTTTAAGCAG CATCGGGCTA AGTGCCAATG GTCAATACTT AGCAACGGCA GGGCTTGATC GTCAGATTAA ACTTTGGCAC GGCCCACAGT TAAATTATCA GACCACTATC ACGACCCATC ACGAGCCAAT CGAAATTCTA GCGTTTAGCC CCAATCCAAC GATTTTAGCG GGCACTGGAC TAGATGGCGA TGTCTACTTG TGGGATTTAC AAGCCAACCA GTTAATCACC AGTTTGCCCA ACGAAGATCG CGTGTTTGAT CTGCAATTTA GCCCTGATGG AGCTAATCTC GCCACCGCTG GCCTTGATCA ATGTATTCGG ATCTGGCAGG TAGAAACCGC TCACCTGACG CATATGCTCT ACGGCCATGC CCATTGGGTA CGGGCTTTAC ACTACAATCG CGATGGTTCA CGGCTCTACT CGGTCAGTAG CGATCAAAGT TTGCGCATCT GGGAGCAAGC TAGTGGCCGC TTGCTGCATA CACTCCAAGG CTATCGCGGC GGTGTACGCA GTTTGGCTTT GAGCAACAAC GCTGATTTAT TATTTAACGC TGGTGAGGCT CAAGCCGTCA CATTATGGCA ACTAGCCGAG CCATTCTATC GCCTGAATCT ACCGCAAGCA ACCAACAATG GCCGCGAATT AGCCTATCAT CAAGCCAGCC AACTACTGGC AATCAGCCAA GAGCAAGTAA TTCAACTCTG GGATTGCCAA CGTTTGCAAT TAGCAACGAT CTTGACTGGT CACCAAGCTT TGATTCGGGC GATCGCCTTT CGCCCTGATG GCAGCATGTT GGCAAGTTGC AGCGAAGATC ATACTGTTCA TGTTTGGTCG ATGCCTCATG GTCAGATTGT CCAAGTCTTT GGCTGTCACG ATGATTTAGT TACCACCCTC GCTTGGAGCC AGAACGGCAG TTTATTGGCA ACTGGCAGCG CTGATCGCAC GATTCGGATT TGGGGCGTAG CTGAACATAG TTGTTTAAGC TTGTTGGCGG GGCATAGCGC GGGCATTATC AGCCTAGCCT TCAGCCCCGA TCAACGCCAT TTGGTCAGTG CTGGAGCCGA TCAACAGGTG CGCATTTGGG ATCTGAGCAA CCAGTGCTAT GAAATTGTGC TCTTGCATAA ACCTGGCTTG CTCAAAGCGG TGCAATGGTC GGCTGATGGG CGCTGGATTG TGATTGCGGC TGGCTCGCTA GCCCTGATTT GGGATTGGCA AAACCAACAA TTGGTTCAGC GGTTTGAGCA TCAAGCAGCC GTCGATAGCA TCTGCCTCAG TAGCGATGGC CACATGCTGA TTACTGGCGA TCAACAAGGG GCAATCGCCA TCTGGCAACT CGCCACTGGT AAATTGCTCA AAAAACTGCA CAGCGATCGT CCTTATCAAG GGCTGATCAT CAACCAAGCC ATTGGTTTAA ATGCAGCCGA GCAGGCCAGC TTGCTCAATT TAGGCGCATT GATCAATTAA
|
Protein sequence | MTNVVLSLPV FHATEGLGAF LGDLRSWVFR SKNRAARHFG LAHTTIMRYE NDQILCPLGY IAALAQLVIE QLDLPPYQRG LAEQQLLATI QYALTEYAID HTPLATWPEL TNLAAAYLAE VQQQKQEQAK TGPLMGVLHD WGDAPDVQNF VGREDETATL VKWLQLDRCR LVAIIGLGGM GKTSLATRVA QQAQDDFKVI VWRSLQQGQQ ANDFLLECLH RIMPSPNSAY PSQFEHRLSV LIDYLRTTRC LLILDNIEAI LQPQYPAGRY REGYEQYAQL FQAISERSHE SCLILTSREK PYEFNRLEGV HTRSMVLTGL MRDDAQMLLD NQELYGTPQL WQELIKHYTG NPLALKLVAQ VIKTMFFGQI AEFLQHEELI FGDVRTILAQ QFERLSDQEQ ELLYWLAIER HTVKLAELKH DLVRSKYQHM LLETLESLLR RSLVERHQDG FMLHNVVLEY TTDRLIDQIA QELLDGTQGL LYRHALIKAN SLDSIREHQS RAILRPLLHR IFVELGQERL LASLRQLLQT MQPLSALEMG YAPGNIFNLL VELKADLSQF DFRHKPLWHA NLRGINPKQL DLSHSDLSRS VFSEQFGALI ALARDPADRF LAVATADDQL IVWQNLDLKK LWQVPSNHDG IRAICFSGDG RYLISAGNDG LIRLWETSQG QNPRILAGHT RPVIGVAIAP QSQQLISASL DGEVRLWDRL SGKCLHRFNA HADGLSSIGL SANGQYLATA GLDRQIKLWH GPQLNYQTTI TTHHEPIEIL AFSPNPTILA GTGLDGDVYL WDLQANQLIT SLPNEDRVFD LQFSPDGANL ATAGLDQCIR IWQVETAHLT HMLYGHAHWV RALHYNRDGS RLYSVSSDQS LRIWEQASGR LLHTLQGYRG GVRSLALSNN ADLLFNAGEA QAVTLWQLAE PFYRLNLPQA TNNGRELAYH QASQLLAISQ EQVIQLWDCQ RLQLATILTG HQALIRAIAF RPDGSMLASC SEDHTVHVWS MPHGQIVQVF GCHDDLVTTL AWSQNGSLLA TGSADRTIRI WGVAEHSCLS LLAGHSAGII SLAFSPDQRH LVSAGADQQV RIWDLSNQCY EIVLLHKPGL LKAVQWSADG RWIVIAAGSL ALIWDWQNQQ LVQRFEHQAA VDSICLSSDG HMLITGDQQG AIAIWQLATG KLLKKLHSDR PYQGLIINQA IGLNAAEQAS LLNLGALIN
|
| |