Gene Haur_2132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2132 
Symbol 
ID5734020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2676715 
End bp2681007 
Gene Length4293 bp 
Protein Length1430 aa 
Translation table11 
GC content50% 
IMG OID641279273 
Producthypothetical protein 
Protein accessionYP_001544900 
Protein GI159898653 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.285692 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACAATC AGATTTTGCA GGAATCCGCT AACCAGCGGG AAAGTGCTCG TTTGTTGCAG 
TTGCGGCTCA CGGGCTTCGT TGGGCGTGAA GCCGAACAGG CCGCGATTCG CGGATTAATC
GACCAAACCC GCCCAAGCGG TGGCTACGTG TTGGTGACGG GCGAGGCTGG AGCGGGTAAA
AGTAGCCTGA TCGCCCAATT AATCGTGAGC GCTGGGCTGG CCCAAACCCC GCAGCATTTT
ATTGCGCTGA CCCCAGGCCG TGCCTATCAA CTCGACCTGC TGCGTAGCAT CGTTGCCCAG
CTCATGCTCA AACACGATCT GGTAATCAAC TATTTTCCTG CCGATAGCTA CCCCGCTTTA
CGCCTTGAAT TTGGCCAGTT GTTACAAACG CTCTCAGCGC GTGGCATCAG CGAAACGATC
TATCTCGATG GCTTGGATCA ATTGCAGCCT GAGGTGGATG GAACCCGCGA TCTCAGTTTT
TTGCCCTTAC AGCTGCCGTC TGGCATCGTG ATGGTGCTTG GCTCACGACC TAACGAGACG
ATCGCTAGTT TAGCGCTTGA ACACGGGGTC GTTTATCAGG TTCCACCGTT GCGCGAGCAG
GATGCGGTTG GGCGTTGGCA GCAGGTCCAG CCGACGTTGG AGCCAGTGAG GTTGCATGGT
TTAGCGCAAG CCGTCAAGGG CAATGCCTTG TTGGTCGAAC TAGCGGCCAA TGTACTGCGC
CACACCACAT CAGCGGAATT GCTGCCATTG CTCGACCACG CCAGCGCTGA CGCGACCAAT
CTCTTCCGGC TGAGCCTTGG ACGGATCGAA CAAGCAGCAC CGCACCACTG GCAATCGCTG
ATTCGCCCAT TGTTGGCAGT GTTATTGGTG ACCCAAGAAT CGCTTGAACT AGCGGTGCTG
GCGGCGATTC TCGAACAACC AACTGCTACA GTGGGCGAGG CGCTGGCCCT GATGAGCGAT
TGGGTGAGTG TTGCCGCCGA TCAGCGCGTG GCTTTGCGCC ATTTGTTGTT TCACGATTTT
CTGATCGACC ACGAATTTGC GGCAGCGGAA CGCCGTGGGT GGCATGGACG GATGGCTGCG
TGGTGTGGCG CAGCGCTTGA CCAGATTTGG CACGATAGTA CAGAATCCGT TGAACAGGCA
CGGCGCTGGT ATGCGCGACA GCATTACATC ACCCATTTGG ATTGCGCAGA ACAATGGGAA
GCATTGTGGC AAGTGATCGA TGCGGGCGAG TATGGCGAGC AGAAAGTGCG CTTTGAGCCA
AGCACCCGCT TGTATGGCTT GGATTTGGAT CGTGCTCGCG AGAGCGTGAT TGCTGCTGGC
CAAAGCCTTG AGCAGCAGCT TGAACTCTTG CCGCGCTTGT GGCGCTATAG TCTACTACGC
ACCAGCCTCA CGGCCCATGC CGATCGTTGG CGTGACTATC ATTTTGTCAT GTTGGCGATG
CTTGGGCGGG TATCTGAGGC GCTAGCCCAG CTCGATATTT GTTCGAATCA AGTATCCCAA
GTGCGAATCT GGTCGCAACT ATTGCCCTAT CTAGAATCTG ATGTACGCTG GCGAATCTTC
CAGCGCATGG AGCAAACAGC CCGTAGCATC CCCGATCCGC GTCGTCGCGA TTATGTGTTG
CATCTCGTTG CTGTGGCCTA TGCCGACTAT GACTTGCTGA AAATGGCCTA TCCAATTGCG
ATTAGCCTTG GTGATAGTCG TGATGAAACG CTGGCACATT TGATTGACGT GGCGATTAAA
CAGCATGATT TAGCCTATGC TCAAGCCATA ATCGGGTATG TTCAAGCACC GGCAGCACAG
ATTAAGCATG CCTTGAATGT AACCAATGCC TTGATTGAAG GCGCAGAATT TGAGGCAGCC
CGACACGTAT TGGCCGACAT TATGCCCTTG GCCCAAGCGG AGCATATTGT TGAAATTAAT
AGTTTGATTG CGTTGATTGA ATGGCGATTG GGTAACCAGC AACAATCCCA AACATTGCTG
GATGAAGCCC AAACCATGAG CGCACACTTA AACCCCGATC TGCGATCTGA TGCGTGGTTG
GCGGTGATTA AGAGTTATGT ACATCAAGGC GATTTGCTCA AGGCTGCAAG TTTGCATCGA
AAAATTCAAT CAGTACAAGT CTGGTGTGAT CTGATCATGT TTTATCGTGA TCGCGCTGAA
GTAACAATAG CGACTGAGTT GGCTATTGCT ATGACCAATG CCCACTTTCG TGATTCGGCC
TGTTTTGCGC TGGTTGAATG GTATTGTACC CATGCTGAAT TTGCGGCTGC CCAACGCCTG
ATTGAATTAA TCGGTTTCGA TTGGGACAAA GTTAAAGCGT ATTGTTTGCT GGCAAGTAGT
TATGCTGAAA ATTTGCAATT TGAACAAATG CTGGCAGTAA TGCAGCTGGC ACACCATACC
GTCCCGCGCG AGCAGCAACA TTCGATTCCA AGTTTGTTAG TCATTGCCGA TACCTATGCC
CGCCACAATC TGCATGAGCA AGCCCGTAGT GTGTTTGAAC AAATATTAGC ACTCTTTTTG
AGCGGCCAAA GGTATAATCG GGATGAACAT AACCTACACT TTGTGCAAAG TACCCAGCGC
TATGGCTATC TCGATCTTTC CGAGCCATTG ATTCAACGCT TATTTACGCT AGGAGATTAT
GGGTTTAATA ATGATCTCGT TGAGCGGATT GCGAAAGTCT ATGCTGAACA AGGCGAACTT
GCCCAAGCAA CCCAGATTAT TCAATCAATT GGTCAAGATT ATCAGTTTAT CCAAGCGGCT
CAAGGGCTGG TGTTGAGCAC AACCAACCAG CAATCTGCTG CTAGCGAATC GTTATTGCTT
GCGGCGCGGC AACGGGTTGC CCAAATTAAC TCCGATGGGC CGTCTGCCCT AAAAACCCTT
TGCGAGCTAG CTGATACAGC GTTGCAGCTT GGATTAACCG CCATCGCCCA AACGTTGCTG
AGCGACGTTC ATCAATCTCT ACTACGCAAG CCACATCTCC TACATCATCC CTTGCTCCAA
TATGAAGCGT GGCTGCTCAA AAGCTATCAA GCCCAACATA AACTAGCCGA TCTGATAGAG
CTTGCACGGT TAATCGACGA TCCGCAAGCC CATGATCGGT GGATTGGCGC AATTCTTGAA
GCCTATCTCC AGGCTGATGA TGTTGGACAA GCCTATCAGC TGCTTCGGTT ATTTAATGAT
TTTGCGGATG TCTATGCTAA AAGTGCTTGT AAACTTGCGA TCAAAGCCAG CCAACTAGGG
TTAAATGAGC TTGCAACCCA AGTGCATCCT GAAGCACTCT CAGCCTGTGA GACCGTTCGA
GAGTCGCGTT ATCGCATTGA GTATCTCAGA GATCTTGCGG TTGCCCAAAT TAACTATGGT
GATGCAGGCT GTTTAGCCCG CTTATTGCAG ATTTTTCGCG AGCAGGAAGC AGTATTCGGC
CAGATCGATT GGTATATTGA AGCGCTTTGC GCAATTGCGG TGGCCTTTGC CGAACAAGGC
GATGCTGTAG CGTTTGCTGA TTGGCTAAAC TATGCACATA AGCGGGCTAC GGCGTTTCCC
ACTGGCTACC AAGCGCTTGC TGAAACGTAT TTTGGCTATA CTCCAGATTC GGCTATCAAG
GCATTTTTGG ATTCTATTGA GCAGCTGGTG CAAATAAGCC TAGATCAGGG CTATGCAGAT
ACTGCTCTTG AGGCGTTGGC TAAAATATAT ACAAGCTATG CAGCCCATGG CCATGCCGAG
TTTTTGGTCA AAGCCCATCA AACCGCAATT AGTATTCCTA ATCGTGACTA TCAATATGCT
GCACTTGTAA CCGTTGCCCA AGCGTATCTG AAGATCAAAG TTATGCCAGA GCTACAAGTG
ATTATTAGTG AACTGAGCCA ACTTGGCTTT GATTTTTGGA TATTTCAGGA TCTTACTGCA
AGTTGCATTG AAGAAGGCGA GCTTCATTTG GCATACCAGT TGATTCTGTT CGATGAGCGT
CATCCAGTCA AGGATGAGGT CATTTGTAAC TTGATTGCCC GCTTAATCCA AGCCGATCAG
CCGATAATTG CTTATCAGTT AACAAGCGAG ATCTACGAGG CTGATAATCG GGCTGGCAGT
TTGCAACAAA TCATTCATTA TTATCTTGAG CGTCAGCAAA TCACTGATGT TATCAAGATT
ATTCAAACCA CATGGCGCGA CTGCCAAAGG AGTTACGAAT TATGGCAATT AAGTACAATC
ATTGTGCCGT TGATTCCCCA CTACCCATGG CTTGGCACTG CCGTGCTCGA TAGCGTGCCC
TGGCTTGAAC AGCAATTAAC TCGCTTGAAT TAA
 
Protein sequence
MHNQILQESA NQRESARLLQ LRLTGFVGRE AEQAAIRGLI DQTRPSGGYV LVTGEAGAGK 
SSLIAQLIVS AGLAQTPQHF IALTPGRAYQ LDLLRSIVAQ LMLKHDLVIN YFPADSYPAL
RLEFGQLLQT LSARGISETI YLDGLDQLQP EVDGTRDLSF LPLQLPSGIV MVLGSRPNET
IASLALEHGV VYQVPPLREQ DAVGRWQQVQ PTLEPVRLHG LAQAVKGNAL LVELAANVLR
HTTSAELLPL LDHASADATN LFRLSLGRIE QAAPHHWQSL IRPLLAVLLV TQESLELAVL
AAILEQPTAT VGEALALMSD WVSVAADQRV ALRHLLFHDF LIDHEFAAAE RRGWHGRMAA
WCGAALDQIW HDSTESVEQA RRWYARQHYI THLDCAEQWE ALWQVIDAGE YGEQKVRFEP
STRLYGLDLD RARESVIAAG QSLEQQLELL PRLWRYSLLR TSLTAHADRW RDYHFVMLAM
LGRVSEALAQ LDICSNQVSQ VRIWSQLLPY LESDVRWRIF QRMEQTARSI PDPRRRDYVL
HLVAVAYADY DLLKMAYPIA ISLGDSRDET LAHLIDVAIK QHDLAYAQAI IGYVQAPAAQ
IKHALNVTNA LIEGAEFEAA RHVLADIMPL AQAEHIVEIN SLIALIEWRL GNQQQSQTLL
DEAQTMSAHL NPDLRSDAWL AVIKSYVHQG DLLKAASLHR KIQSVQVWCD LIMFYRDRAE
VTIATELAIA MTNAHFRDSA CFALVEWYCT HAEFAAAQRL IELIGFDWDK VKAYCLLASS
YAENLQFEQM LAVMQLAHHT VPREQQHSIP SLLVIADTYA RHNLHEQARS VFEQILALFL
SGQRYNRDEH NLHFVQSTQR YGYLDLSEPL IQRLFTLGDY GFNNDLVERI AKVYAEQGEL
AQATQIIQSI GQDYQFIQAA QGLVLSTTNQ QSAASESLLL AARQRVAQIN SDGPSALKTL
CELADTALQL GLTAIAQTLL SDVHQSLLRK PHLLHHPLLQ YEAWLLKSYQ AQHKLADLIE
LARLIDDPQA HDRWIGAILE AYLQADDVGQ AYQLLRLFND FADVYAKSAC KLAIKASQLG
LNELATQVHP EALSACETVR ESRYRIEYLR DLAVAQINYG DAGCLARLLQ IFREQEAVFG
QIDWYIEALC AIAVAFAEQG DAVAFADWLN YAHKRATAFP TGYQALAETY FGYTPDSAIK
AFLDSIEQLV QISLDQGYAD TALEALAKIY TSYAAHGHAE FLVKAHQTAI SIPNRDYQYA
ALVTVAQAYL KIKVMPELQV IISELSQLGF DFWIFQDLTA SCIEEGELHL AYQLILFDER
HPVKDEVICN LIARLIQADQ PIIAYQLTSE IYEADNRAGS LQQIIHYYLE RQQITDVIKI
IQTTWRDCQR SYELWQLSTI IVPLIPHYPW LGTAVLDSVP WLEQQLTRLN