Gene Haur_0103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0103 
Symbol 
ID5731996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp132307 
End bp135936 
Gene Length3630 bp 
Protein Length1209 aa 
Translation table11 
GC content50% 
IMG OID641277225 
ProductWD-40 repeat-containing protein 
Protein accessionYP_001542883 
Protein GI159896636 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.572371 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAACG TTGTTTTATC GTTGCCTGTT TTTCATGCAA CCGAAGGGCT TGGCGCATTT 
CTCGGCGATT TGCGTTCATG GGTTTTTCGT TCAAAAAATC GGGCCGCCCG CCACTTTGGC
TTGGCGCATA CCACAATTAT GCGCTATGAA AATGACCAAA TTTTGTGTCC GCTGGGCTAT
ATCGCCGCCC TCGCGCAGTT GGTGATCGAG CAGTTGGATC TGCCGCCCTA TCAGCGTGGG
TTGGCCGAGC AGCAATTATT AGCCACGATT CAATATGCAT TAACTGAGTA TGCGATTGAC
CACACGCCGC TGGCAACGTG GCCTGAGTTA ACAAATTTGG CCGCAGCTTA CTTGGCTGAA
GTGCAGCAGC AAAAGCAGGA GCAGGCTAAA ACTGGTCCAT TGATGGGCGT TTTGCACGAT
TGGGGCGATG CGCCCGATGT ACAGAATTTT GTCGGTCGCG AGGACGAAAC TGCTACCCTT
GTTAAATGGT TACAGCTCGA TCGTTGCCGT TTGGTGGCGA TTATTGGGTT GGGCGGTATG
GGCAAAACCA GCCTTGCCAC CCGCGTTGCC CAACAAGCCC AAGATGATTT TAAGGTGATC
GTCTGGCGTT CGCTGCAACA AGGTCAACAG GCCAATGATT TTCTGTTGGA ATGTTTGCAC
CGGATTATGC CCAGTCCCAA TTCGGCCTAT CCAAGTCAAT TTGAGCACCG CCTAAGTGTG
TTGATCGATT ATTTGCGTAC CACCCGCTGC TTGTTAATTC TCGACAATAT CGAGGCTATT
TTGCAGCCGC AATATCCAGC TGGCCGCTAC CGCGAGGGCT ATGAGCAATA TGCCCAACTG
TTTCAAGCAA TCAGCGAGCG TTCCCACGAA AGCTGTTTGA TTCTGACGAG CCGCGAAAAA
CCCTATGAAT TCAATCGACT CGAAGGTGTG CATACCCGTT CGATGGTTTT GACGGGACTT
ATGCGCGATG ATGCTCAAAT GTTGCTCGAT AATCAAGAGT TGTATGGCAC GCCGCAACTG
TGGCAGGAGC TCATCAAGCA CTATACTGGC AATCCTTTGG CCTTAAAATT AGTTGCCCAA
GTGATCAAAA CCATGTTTTT TGGCCAAATT GCTGAATTTT TGCAGCACGA AGAATTAATT
TTTGGCGATG TGCGCACAAT TTTGGCTCAG CAATTTGAAC GTTTATCCGA CCAAGAGCAA
GAATTATTGT ATTGGCTCGC AATCGAACGC CACACGGTCA AATTAGCTGA GCTTAAGCAT
GACCTGGTGC GTTCAAAATA TCAACATATG CTGCTCGAAA CCCTTGAATC GTTGCTGCGC
CGTTCGTTGG TGGAGCGGCA TCAAGATGGG TTTATGTTGC ATAATGTGGT GCTCGAATAT
ACAACCGACC GTTTGATCGA CCAAATTGCC CAAGAATTAC TCGATGGGAC GCAGGGTTTG
CTCTATCGCC ATGCCTTGAT CAAAGCCAAC AGCTTGGATA GCATTCGCGA ACACCAAAGT
CGGGCAATTT TGCGGCCATT GCTGCACCGG ATTTTCGTTG AGCTTGGTCA AGAGCGTTTG
CTGGCAAGCC TACGTCAATT ATTGCAAACC ATGCAGCCCT TGAGCGCATT GGAAATGGGC
TATGCGCCAG GTAATATTTT TAATTTATTG GTTGAACTCA AGGCCGATTT GAGCCAATTT
GATTTTCGGC ATAAACCGCT GTGGCACGCC AACCTACGTG GCATCAACCC CAAACAGCTT
GATTTGAGCC ATAGCGACCT TTCGCGCAGT GTGTTTAGCG AACAATTTGG CGCATTGATT
GCCCTAGCCC GCGATCCAGC TGATCGTTTT TTGGCCGTAG CAACCGCTGA TGATCAATTG
ATTGTTTGGC AAAACTTAGA TCTGAAAAAA CTTTGGCAAG TACCCAGTAA CCACGATGGC
ATTCGGGCAA TCTGTTTTAG CGGCGATGGG CGCTACTTGA TCAGCGCTGG TAACGATGGA
CTCATTCGGC TGTGGGAGAC CAGTCAAGGC CAGAACCCAC GTATTTTAGC AGGCCATACC
CGACCAGTAA TTGGCGTGGC GATTGCGCCC CAAAGTCAAC AATTAATCAG TGCCAGCCTT
GATGGTGAGG TTCGCCTGTG GGATCGGCTC AGTGGCAAGT GTTTGCATCG TTTTAATGCG
CATGCCGACG GTTTAAGCAG CATCGGGCTA AGTGCCAATG GTCAATACTT AGCAACGGCA
GGGCTTGATC GTCAGATTAA ACTTTGGCAC GGCCCACAGT TAAATTATCA GACCACTATC
ACGACCCATC ACGAGCCAAT CGAAATTCTA GCGTTTAGCC CCAATCCAAC GATTTTAGCG
GGCACTGGAC TAGATGGCGA TGTCTACTTG TGGGATTTAC AAGCCAACCA GTTAATCACC
AGTTTGCCCA ACGAAGATCG CGTGTTTGAT CTGCAATTTA GCCCTGATGG AGCTAATCTC
GCCACCGCTG GCCTTGATCA ATGTATTCGG ATCTGGCAGG TAGAAACCGC TCACCTGACG
CATATGCTCT ACGGCCATGC CCATTGGGTA CGGGCTTTAC ACTACAATCG CGATGGTTCA
CGGCTCTACT CGGTCAGTAG CGATCAAAGT TTGCGCATCT GGGAGCAAGC TAGTGGCCGC
TTGCTGCATA CACTCCAAGG CTATCGCGGC GGTGTACGCA GTTTGGCTTT GAGCAACAAC
GCTGATTTAT TATTTAACGC TGGTGAGGCT CAAGCCGTCA CATTATGGCA ACTAGCCGAG
CCATTCTATC GCCTGAATCT ACCGCAAGCA ACCAACAATG GCCGCGAATT AGCCTATCAT
CAAGCCAGCC AACTACTGGC AATCAGCCAA GAGCAAGTAA TTCAACTCTG GGATTGCCAA
CGTTTGCAAT TAGCAACGAT CTTGACTGGT CACCAAGCTT TGATTCGGGC GATCGCCTTT
CGCCCTGATG GCAGCATGTT GGCAAGTTGC AGCGAAGATC ATACTGTTCA TGTTTGGTCG
ATGCCTCATG GTCAGATTGT CCAAGTCTTT GGCTGTCACG ATGATTTAGT TACCACCCTC
GCTTGGAGCC AGAACGGCAG TTTATTGGCA ACTGGCAGCG CTGATCGCAC GATTCGGATT
TGGGGCGTAG CTGAACATAG TTGTTTAAGC TTGTTGGCGG GGCATAGCGC GGGCATTATC
AGCCTAGCCT TCAGCCCCGA TCAACGCCAT TTGGTCAGTG CTGGAGCCGA TCAACAGGTG
CGCATTTGGG ATCTGAGCAA CCAGTGCTAT GAAATTGTGC TCTTGCATAA ACCTGGCTTG
CTCAAAGCGG TGCAATGGTC GGCTGATGGG CGCTGGATTG TGATTGCGGC TGGCTCGCTA
GCCCTGATTT GGGATTGGCA AAACCAACAA TTGGTTCAGC GGTTTGAGCA TCAAGCAGCC
GTCGATAGCA TCTGCCTCAG TAGCGATGGC CACATGCTGA TTACTGGCGA TCAACAAGGG
GCAATCGCCA TCTGGCAACT CGCCACTGGT AAATTGCTCA AAAAACTGCA CAGCGATCGT
CCTTATCAAG GGCTGATCAT CAACCAAGCC ATTGGTTTAA ATGCAGCCGA GCAGGCCAGC
TTGCTCAATT TAGGCGCATT GATCAATTAA
 
Protein sequence
MTNVVLSLPV FHATEGLGAF LGDLRSWVFR SKNRAARHFG LAHTTIMRYE NDQILCPLGY 
IAALAQLVIE QLDLPPYQRG LAEQQLLATI QYALTEYAID HTPLATWPEL TNLAAAYLAE
VQQQKQEQAK TGPLMGVLHD WGDAPDVQNF VGREDETATL VKWLQLDRCR LVAIIGLGGM
GKTSLATRVA QQAQDDFKVI VWRSLQQGQQ ANDFLLECLH RIMPSPNSAY PSQFEHRLSV
LIDYLRTTRC LLILDNIEAI LQPQYPAGRY REGYEQYAQL FQAISERSHE SCLILTSREK
PYEFNRLEGV HTRSMVLTGL MRDDAQMLLD NQELYGTPQL WQELIKHYTG NPLALKLVAQ
VIKTMFFGQI AEFLQHEELI FGDVRTILAQ QFERLSDQEQ ELLYWLAIER HTVKLAELKH
DLVRSKYQHM LLETLESLLR RSLVERHQDG FMLHNVVLEY TTDRLIDQIA QELLDGTQGL
LYRHALIKAN SLDSIREHQS RAILRPLLHR IFVELGQERL LASLRQLLQT MQPLSALEMG
YAPGNIFNLL VELKADLSQF DFRHKPLWHA NLRGINPKQL DLSHSDLSRS VFSEQFGALI
ALARDPADRF LAVATADDQL IVWQNLDLKK LWQVPSNHDG IRAICFSGDG RYLISAGNDG
LIRLWETSQG QNPRILAGHT RPVIGVAIAP QSQQLISASL DGEVRLWDRL SGKCLHRFNA
HADGLSSIGL SANGQYLATA GLDRQIKLWH GPQLNYQTTI TTHHEPIEIL AFSPNPTILA
GTGLDGDVYL WDLQANQLIT SLPNEDRVFD LQFSPDGANL ATAGLDQCIR IWQVETAHLT
HMLYGHAHWV RALHYNRDGS RLYSVSSDQS LRIWEQASGR LLHTLQGYRG GVRSLALSNN
ADLLFNAGEA QAVTLWQLAE PFYRLNLPQA TNNGRELAYH QASQLLAISQ EQVIQLWDCQ
RLQLATILTG HQALIRAIAF RPDGSMLASC SEDHTVHVWS MPHGQIVQVF GCHDDLVTTL
AWSQNGSLLA TGSADRTIRI WGVAEHSCLS LLAGHSAGII SLAFSPDQRH LVSAGADQQV
RIWDLSNQCY EIVLLHKPGL LKAVQWSADG RWIVIAAGSL ALIWDWQNQQ LVQRFEHQAA
VDSICLSSDG HMLITGDQQG AIAIWQLATG KLLKKLHSDR PYQGLIINQA IGLNAAEQAS
LLNLGALIN