Gene Haur_0045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0045 
Symbol 
ID5731917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp57811 
End bp60849 
Gene Length3039 bp 
Protein Length1012 aa 
Translation table11 
GC content47% 
IMG OID641277166 
Producthypothetical protein 
Protein accessionYP_001542825 
Protein GI159896578 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCGTT TCAGTAAAGT CCGCTTACTG GGAATATGTG TTTTGCTGCT AAGTTTGACG 
GCTTATCAGC GAACTCCCAA TAGTTCGGCT CAAGTGCTTC GCAGCCTTGT GCCAGTGCCC
GAAGTCGAAT CAAACGATAC ATGGCAAAAT GCCCATGATC TGACCAGCAG TTGTTTATGG
CCGAGCATTA CAATTTGTGA TGTAACGGGC GAGCTTGATG ATGGCGAGGA TTTAGATTGG
TACAAAATTA CGGTGCGGCC TGCAACCACC TTGTCAATTA GCCTCACCGA TACCTTTGGT
GATTATCAAG TTGCTTTATT TTATGATTTA GCCAAGCCAA TCACCCAAAC TGGCCATGTG
TCGCTGGGCA ACCTGAACGC GATTGGCAAC CTGAACGCGA TTGGCAACCT GAACGCGATT
GGCAACCTGA ACGCGATTGG CAACCTGAAC GCGATTGGCA ACCTGAACGC GATTGGCAAC
CTGAACGCGA TTGGCAACCT GAACGCGATT GGTTCAAATG GCTATTCCGA TGCCAGCATT
GATAGCTTTT TATGGACACC TGGTTTTTAT TATGTGCTGG TCTATCCGCT GGATGAAGAT
CAATCGGGTG ATTACAACTT GGCAATTGAA GCAACATTTG ATCAAAATTT GACCAAACCA
TTGCCAATTC CCTTTATGAT TAACCAGAGT GAGCCACCAG ATTGCAAACG CTCGCTGTAT
ATCACCAACA GCGATTACTT TAATATTTAT GAAGCATTCG CGCCTTATAA CACCAGTGTT
TATACCGAAT TACTTGGTTT AGCATCGAAA CCAGGCGATA CAAGCGGATT GGTGCTTGAT
TTAGCCAATA TTTCATGGAT TCAGCCAAGT AATGCATTTA GTGATCAATC AACGCAGTGG
GATAACAATC GTAATAATCC GTTCTATGCC AATCGGATTG CTGAAGGCGT GCATCATCTA
ATCACTCAAT ATGCGCGACG CTGTGCGAGC TTGCGCTATG TCACAATTGT TGGCGGCGAT
TACGTTGTGC CATTCTATCG CGTGCCCGAC GAAACCGTGA TTGCCAACGA AGGCGATTAT
TTGGCAAGTT CGGGAATTAA CACCAATAGC TTCACCGCCA GCAGCCTGAA ATACAAAACA
ATCTTGACTG ATAATATTTA TGGGGCAATC AAGCCAATTC CTTATCGTGG TCGCCAGTTA
TGGGTGCCTG AACTCAGCGT AGGCCGCTTA GTCGAAGGTG CTGATGCAAT TCATTTATAT
CTCAAGAGCA TGAACCTGCA TACCGATTTC CCGAGTGGCA GCATGATCAA TTTAAATCCT
CAACCGCAAC AAGCAAATTT GGTGACAGGT TATGATTTCT TGCTTGATCA AGCCAAGGTT
ATCTCGGGAA CATTGACAAA TTTAGTCGGT GCGCGTCCGG TTGGTTTGAT CAGTGATACC
TGGGATGCAG ATGATTTACT CTCAATCTGG TGGCCCGATG TCAATAATGG GATGTCCAAC
TCGCCCTATC CAGCTCAATC AATTAATGCC CACTTTACCC ACTATCAGGC GATTCCGGCC
AATTTTGAAA CCTCAACAGA TGTAGTCGAA GCAAAGGTTC TCTTTCAGCA TCCTTTTAGT
CCGGTCGCCT TAGAAGATGC TTATCAACGT AATCGTTTAG GCTATAGCGT TGGCTGCCAC
TCAGGCTATA GCGTCTACGA TAACGATGTT GCTGGTGGCC TTACAAACCC AGTTGCGATC
GATTTCCCTC AAGCGTATAT GAAGCAACAT GGAGCATGGA TTGGCAATAC TGGTTTTGGC
TATGGTGATA GTGATTTAGT CGGCTATTCG GAAAAACTCA TGGCTCAATT TACTCTCGAA
ATTGGCCGTG AGTGGAAAAA TCAATCAGGA TTTTATGGGG GCATGCCTAT TGGCTATGCC
TTAGTTCGGG CCAAACGCAG CTACTTACGC GAAGGCACAC CAGGCACATT TAGTGTCTAC
GATGAGAAAG TGCTGTTGCA ATCAACCTTG TATGGTTTGC CAATGATTCG GGTCTTGGTG
CCTAATCCAA TTCCCTTCCC CCCAAATGAT CGCCAAGATT TTGCCTGTTT GACCTGCCCA
CCATCAGTTG CCCATCGAGC AACTGTCACC GATCGGGCAA TTGATCGCGA TATTACCTTT
AACATTAACT ACACGACTGC CCAAGATAAT GCCCGTGGCA AAATCCTCAG GGCACAAGCA
ACCGTGGTCA ACGATACCGA AGGTTTGCTC ACGGGCCAGA TGTTTAATAG CTCGTTGGTC
AATGCGGCGG GTTTGCCATC ATTGCCTAAC TTTAGCATTC CACTGTCGCT GACCGATGGT
TTTGGGGAAC AACGGATTCA AGGGATTCAG TTCCTTGGTG GAACCACAAC CACCGCCGCA
ACCTTTAATC CGGTTATTAC GCGTTTGATT ACTGATGAGA TTTATTTGAG CCACGAACCA
ACCTTTGATT TTACTGATCG TTGGTATCCC GACCAACCAT TTAGCTATAG CGTGTTCGAA
AGCGATAATC TCGATGATGA TGGCCCGCTC AACCAAGATC GACGTTTCTC GCAGCTCTTG
CTAACTCCAA CCCAATTTAA AGGCAACACA TCGGGCGGTG AGTTACGCCA GTTCAGCACC
ATTACGCTAC GGCTGAAATA CTTGAGCGAT GGCGCTGATG ATGATTTGCT TGATGATACT
TTTGACCCAA TCGTTCGTGA TACCCAGCGC ACTAGCGACG GCAAAATTCG GGCAATTGTG
ATCGAGCGCG ATAACGATGG CCCGGGTGAT GAGCTTGATG CTGAGATGGT TGTCAAAACC
AACACTGGTG CTTGGCAAAC TGTCAGCATG AATACCTTCC AGATTGGCAC AACTGAACGC
TGGCAAATTG AAGGCAGCCT TCCAGCCAAT ACCTTTCTAC CCTTGCAACT GTTGATTCAA
GCGGAAGATG AGGCTGGTAA TGTCGGGGTT GAAACCTATG GCGGGCGTTT CGATATTGAT
TATGAAATGT ATCTGCCCAA TGTGAATGTC AACCGCTAG
 
Protein sequence
MRRFSKVRLL GICVLLLSLT AYQRTPNSSA QVLRSLVPVP EVESNDTWQN AHDLTSSCLW 
PSITICDVTG ELDDGEDLDW YKITVRPATT LSISLTDTFG DYQVALFYDL AKPITQTGHV
SLGNLNAIGN LNAIGNLNAI GNLNAIGNLN AIGNLNAIGN LNAIGNLNAI GSNGYSDASI
DSFLWTPGFY YVLVYPLDED QSGDYNLAIE ATFDQNLTKP LPIPFMINQS EPPDCKRSLY
ITNSDYFNIY EAFAPYNTSV YTELLGLASK PGDTSGLVLD LANISWIQPS NAFSDQSTQW
DNNRNNPFYA NRIAEGVHHL ITQYARRCAS LRYVTIVGGD YVVPFYRVPD ETVIANEGDY
LASSGINTNS FTASSLKYKT ILTDNIYGAI KPIPYRGRQL WVPELSVGRL VEGADAIHLY
LKSMNLHTDF PSGSMINLNP QPQQANLVTG YDFLLDQAKV ISGTLTNLVG ARPVGLISDT
WDADDLLSIW WPDVNNGMSN SPYPAQSINA HFTHYQAIPA NFETSTDVVE AKVLFQHPFS
PVALEDAYQR NRLGYSVGCH SGYSVYDNDV AGGLTNPVAI DFPQAYMKQH GAWIGNTGFG
YGDSDLVGYS EKLMAQFTLE IGREWKNQSG FYGGMPIGYA LVRAKRSYLR EGTPGTFSVY
DEKVLLQSTL YGLPMIRVLV PNPIPFPPND RQDFACLTCP PSVAHRATVT DRAIDRDITF
NINYTTAQDN ARGKILRAQA TVVNDTEGLL TGQMFNSSLV NAAGLPSLPN FSIPLSLTDG
FGEQRIQGIQ FLGGTTTTAA TFNPVITRLI TDEIYLSHEP TFDFTDRWYP DQPFSYSVFE
SDNLDDDGPL NQDRRFSQLL LTPTQFKGNT SGGELRQFST ITLRLKYLSD GADDDLLDDT
FDPIVRDTQR TSDGKIRAIV IERDNDGPGD ELDAEMVVKT NTGAWQTVSM NTFQIGTTER
WQIEGSLPAN TFLPLQLLIQ AEDEAGNVGV ETYGGRFDID YEMYLPNVNV NR