Gene Haur_4484 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4484 
Symbol 
ID5736335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5738668 
End bp5742786 
Gene Length4119 bp 
Protein Length1372 aa 
Translation table11 
GC content51% 
IMG OID641281647 
Producthypothetical protein 
Protein accessionYP_001547244 
Protein GI159900997 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.517652 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAATC GTAGTTCATG GCAATTTCGC AGTCAGTTGT TGGTGAGTAG CGGCCTTGTA 
GGGCTTATGG TGCTACTCAC AACGTTCTTA ACCACATCAT CATCGGCCCA AACCCGCGAG
TGTCGGATGG CTGATGCTTT GAAGATTTGT GCTAATACGT TTGTTGCAAC CGATCCCACA
CACTTTATTG CTCGCGATGA TGTAACGCTG GCGATTGGTG ATGCGCCACC ATTGATCAGC
GCTGGCGCGG TTGGCGCGAA CCTTGGCGAG TTTGTGTTCA CTGCCGATCA GTCGGTGCTG
TCGGGCGCAG TTAAATTTAT TGGCGATAAT GCTAGCTTGC CGCTAGTTGC CTCAACCTAT
AATGCCAACA ACACGCCGAA AGAGGTGTTT GAGGTTGATA CGACTGGCCT AACGATTACC
AATGATCAAA GCAGTGCTGA TCCAATCGGC GTAATCGCCA ATAGCACGAT CAGCCTGCAC
TTTCTTGATC GCTCGGGTGT GCGCAGTTTT TATAAGACGA CCGACCCCAG CGAGACCGAA
GATCTGAGCT TTGTGTTCGA TCTGGCCGCA GCAGAATTTC GGGCTGAGCT GCCGATTAAC
CTCAACATCG CAGCATTAAA CCCAAGCACG TCTGAGAATC CCAATCTCAA TATTGTGGTG
AATCTCAAAT ATTCGCAACA AGGCGTGCTC TCAGGCAATG TCGATAATTT CACCATGTCA
TTGGCTGGCA TGAGTGTTGC AGTCAAAGAG ATTGCGCTAT CAACTGGCTC TTTTGAAGCT
GGTTTGGTCG AAGTTTCACG AGCGGCTAAT CCCGATTTAC CAAACCTTGA TCCGGCTAAG
CCAAATTTGG TCTTTTCGCT GCAAAATCTC AAGTATGCGA ACCGCAGCTT TAGCATTGGC
GGCGGCTCGG TGCCAATTCC CGATTGGAAA TTTGGCGGCA ACTTTAGCAT GACCGACCAA
ACCTTGAGCA TTGGCCATGA TAATGCGACT GGTACGTCAA CCGTTAGCGT CAACTCAACC
TTGATTTTTG GCAATGTGCT GAACCAACCA GAATCAGCCA CGCCGCGCGA AGTAACCCTG
ACATTTAGTG CCGTCAAAGT CAACGGTGTT TTCAAGCCTG TTTTCAGTGC TACTGTTGCT
GAAACCACCG TTGCAATGGG GCCGTTGAAT TTTCGCCTGC GCGGGTTGAG CTTAATCGGC
GATACAGTAG AAAATTTCTA TGGCTTGAAA GCAAGTGCGG TTGATCTGCT GTGGTCAAGC
GATATGGGCG GTCAATCGGC AGCAGGTATT ACTGGCTTCA AATTTGGCAT CAATAAAGAT
GGTAAGTTGC AATTTGCCCT CGGCGGTGCG ACGATCAGCA CTCCTAATAT TTCGAGCGGC
GTGTTGGAAG GCAGCAGCAT CGTTGGTCAG TTTGGCGTGG CCCAACAAAC CCTCAACCTA
ACCGTGACTG GTAACCTGAG CGTAAAAATC CCAGGTAACA GCGGTGTTGG TGCAGGCTTG
GTGATGGTCG TGCGTGGCGG CCCGAATGTT GCGCCGATTG GCAGCCAAAA CTGTGCCCAA
TCACCATGTA TCAAACGTTT CGAAGCCTCG CTAACCAACT TTAGCGTCAA AATTGCTGGT
TTCAGCATGG CCTTGCAGAA TCCCCGCTTC CTTGATGATG GTGGTTTTGC CGCCGATAGT
GCTCGACTCT CGATGTCAGA CCTGATGGGC AACCTTACCG CCGATGTGTC TGGCCTCAGC
ATCAGTGGGC GCGGCGAAGT TTCGGTAACT GGCGGCGGGA TTGAGTTGCC ACCGCTCAAG
ATTGCAGGAA CTAATTTTGT TGGTTTCCGT GGCTTCTTCA GCAAAGATGG CGCTGGCTAC
ATGTTTGCTG GTGGGGCAAC TCTGAGCATG CCTGGCTTTG ATCCTAGCGG CGGCTCAACG
ATTTCGGTTG ATGTTTCGGT CAAAACCTTG CCAACCGGGG TGTTTAATGA GTTGGATGTC
GTGGTGGCCT TCGAGTCATC GCCAGGCATT CCGTTGGCCA ACAGTGGCGC TGCCCTGACC
AAGATGAGTG GTTCGTTCTC GCTCAAGTCA GGCTCGGTCA CGATTGGGGT TGGCATTGAA
GTAAGTTCAG TTGCTCAGCT CGCAGGAATT CCGCTAGTTT CGGCAGAGGG AACTGCAACC
TTGGTGGTTG ATCCATTCAA GTTCTCGCTG ACCGCCAGCA TGAAGGTATT GATTTTTGAA
GTTGCTAGCG CTAGTGTCGA AATTGGCCAC GAAGCTGGGT TTAGCGGCGG CAAGGGTCTG
CATGCTAAGT TCCAATTTGA AGCAGTGATT GTACGTGGCG GCTTGGAACT GCGGGTTGGC
ACGGTCACGG TTCGTAGCTG TACGCCTGCT GGCTCAACCA ATTGTGTTGA TAAACAAAAA
CTACGCTTTG CTGGCTCAGC ACGAATGTCG GTTGGCTTGC GCAAAGGCCA GTTCGGCAAA
GCCTTACCAC CAAAGAATAT TACCTTTGGC TCAGTTTCAT TCCAAATGGG CGAGTTCGAA
AAATCGGGTG GTGGTACGAC CGTAGGGATG TTGGGCCGCG TGAGCTGCTG TTTCGGCATC
TTCAAAGTGA GTGTATTTGT TGATTTGAGC AAGCCGGTTG GATTGAACAC TGGTTTTGTC
AAGCTCGTCA ACCCCAAAAA TTATCGCCTG ATTAACTCGC TCCAAATTGC GCAGAGTATC
GAGCAAGGCG AACCAGGCTA CAGCCAACGA ATCATCTCAC GGCCTGTCAA CCCAGGGAAA
TCGGGTGGTG CCTTGTTTGC GGCAATCCCT GAAGTTACAG TACCAGTGGT GATTACCTCG
ACCGCCAGTG GCTACTTTGG GATTCACTTT AGTGGAACTC CTTCAGTTGA ACCAGTTATC
CGCTTGATTC TGCCTGATGG AACGGAACTC AATGAAGGCA ATGTCAATGA CACAACTCAG
ACCTTGATTC GCGATTACAC CACCGTGATT ACGGAAGGCA ATGACTTGGC CTTTATGCTC
GAAGCGGCCA CACCTGGTAC GTATAGTTTG ATTATTGAAG GGCCACCAAG CCAATACGAA
GTGGTTGCTT ACCAACTGAA TAACCCACCA ATTTTCGATA GCGCAACCTT GGCTTGTGGC
GGTGCGGCAA CCCCAGGCGT AACCGTAACA TGTAACTCGG CTCCAACTGG CAGCAAAGTT
ACGGTTAATT GGGCGACCCG CGATACCGAT GACCCTAACG CCAAAGTTTC GCTGGTCTAT
GCTAGCGTGA TCACCCCAAC CGATCCCATT GATGTAGGCT TGAGCACGGT CATTAGCGAT
AACATCAAAC TTGGTACAGG CCAGCATGTT TGGGATCTAA GCGAGATTCC AAGTGGTCAA
TATAAGCTGG CGCTCTTCGC CGACGATGGC CATAATCAAC CAACCGTTCA ACAATTGGAT
ACCTTGATTG TGGTCAATGA TCAGCGTGCG CCCAAAATTC CAACCAATCT TCAAGCGACT
CCATTGCCCG GTCAACTGTT GGTCAAGTGG ACACCGAACA GCGAAATGGA CCTTGGTGGC
TATGAGATTG GCTTTGGTGA AGTCAATGAT CCAAATGAGT TCCTCTACTC CCGCAATATG
GGTGGCAAAG AGATGATCTT TACTGCGACG AACCAACTTG ATGCCAAACT GTGGGGCTTG
AAAGACAATC AATCGATCTT CTATGGGATT CGAGCCTATG ATCTTAGTGG CAACTTCAGC
GCTTGGTCAC CGTTAGTTGT GGGCACGCCG TGGTCGCTAA GCCCACATGC TTGGAATCCA
GTGCCTGGAG GACGCGGTGT GACAACCACC AAGATTGAGG CCGCCTTTGA AACTCCACTG
AGCGAGGCAT CGCTGACGAA TGCCTTCCAA GTTCGCAATG CCAGCAATCA GTTGGTAGCC
GGAACACCAA TCTATCTCTA CAATCTTGAT AAAACTGAGA TTATCGGATT TAGCTTCAAG
CCTAGCGCTA CGCTGGTCGA TGGTGAAACC TACACCGTGA CCATTCGTGG CGGAGCCAAC
GGAATTAGAT CGAAAGATGG CCGTCAAATG CCGGCTGACT TCAGTTGGAA ATTCGAGGTC
GAATCGTATC AAATCTATCT GCCAGCCGTG AAGCGCTAA
 
Protein sequence
MINRSSWQFR SQLLVSSGLV GLMVLLTTFL TTSSSAQTRE CRMADALKIC ANTFVATDPT 
HFIARDDVTL AIGDAPPLIS AGAVGANLGE FVFTADQSVL SGAVKFIGDN ASLPLVASTY
NANNTPKEVF EVDTTGLTIT NDQSSADPIG VIANSTISLH FLDRSGVRSF YKTTDPSETE
DLSFVFDLAA AEFRAELPIN LNIAALNPST SENPNLNIVV NLKYSQQGVL SGNVDNFTMS
LAGMSVAVKE IALSTGSFEA GLVEVSRAAN PDLPNLDPAK PNLVFSLQNL KYANRSFSIG
GGSVPIPDWK FGGNFSMTDQ TLSIGHDNAT GTSTVSVNST LIFGNVLNQP ESATPREVTL
TFSAVKVNGV FKPVFSATVA ETTVAMGPLN FRLRGLSLIG DTVENFYGLK ASAVDLLWSS
DMGGQSAAGI TGFKFGINKD GKLQFALGGA TISTPNISSG VLEGSSIVGQ FGVAQQTLNL
TVTGNLSVKI PGNSGVGAGL VMVVRGGPNV APIGSQNCAQ SPCIKRFEAS LTNFSVKIAG
FSMALQNPRF LDDGGFAADS ARLSMSDLMG NLTADVSGLS ISGRGEVSVT GGGIELPPLK
IAGTNFVGFR GFFSKDGAGY MFAGGATLSM PGFDPSGGST ISVDVSVKTL PTGVFNELDV
VVAFESSPGI PLANSGAALT KMSGSFSLKS GSVTIGVGIE VSSVAQLAGI PLVSAEGTAT
LVVDPFKFSL TASMKVLIFE VASASVEIGH EAGFSGGKGL HAKFQFEAVI VRGGLELRVG
TVTVRSCTPA GSTNCVDKQK LRFAGSARMS VGLRKGQFGK ALPPKNITFG SVSFQMGEFE
KSGGGTTVGM LGRVSCCFGI FKVSVFVDLS KPVGLNTGFV KLVNPKNYRL INSLQIAQSI
EQGEPGYSQR IISRPVNPGK SGGALFAAIP EVTVPVVITS TASGYFGIHF SGTPSVEPVI
RLILPDGTEL NEGNVNDTTQ TLIRDYTTVI TEGNDLAFML EAATPGTYSL IIEGPPSQYE
VVAYQLNNPP IFDSATLACG GAATPGVTVT CNSAPTGSKV TVNWATRDTD DPNAKVSLVY
ASVITPTDPI DVGLSTVISD NIKLGTGQHV WDLSEIPSGQ YKLALFADDG HNQPTVQQLD
TLIVVNDQRA PKIPTNLQAT PLPGQLLVKW TPNSEMDLGG YEIGFGEVND PNEFLYSRNM
GGKEMIFTAT NQLDAKLWGL KDNQSIFYGI RAYDLSGNFS AWSPLVVGTP WSLSPHAWNP
VPGGRGVTTT KIEAAFETPL SEASLTNAFQ VRNASNQLVA GTPIYLYNLD KTEIIGFSFK
PSATLVDGET YTVTIRGGAN GIRSKDGRQM PADFSWKFEV ESYQIYLPAV KR