Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0045 |
Symbol | |
ID | 5731917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 57811 |
End bp | 60849 |
Gene Length | 3039 bp |
Protein Length | 1012 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641277166 |
Product | hypothetical protein |
Protein accession | YP_001542825 |
Protein GI | 159896578 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCGTT TCAGTAAAGT CCGCTTACTG GGAATATGTG TTTTGCTGCT AAGTTTGACG GCTTATCAGC GAACTCCCAA TAGTTCGGCT CAAGTGCTTC GCAGCCTTGT GCCAGTGCCC GAAGTCGAAT CAAACGATAC ATGGCAAAAT GCCCATGATC TGACCAGCAG TTGTTTATGG CCGAGCATTA CAATTTGTGA TGTAACGGGC GAGCTTGATG ATGGCGAGGA TTTAGATTGG TACAAAATTA CGGTGCGGCC TGCAACCACC TTGTCAATTA GCCTCACCGA TACCTTTGGT GATTATCAAG TTGCTTTATT TTATGATTTA GCCAAGCCAA TCACCCAAAC TGGCCATGTG TCGCTGGGCA ACCTGAACGC GATTGGCAAC CTGAACGCGA TTGGCAACCT GAACGCGATT GGCAACCTGA ACGCGATTGG CAACCTGAAC GCGATTGGCA ACCTGAACGC GATTGGCAAC CTGAACGCGA TTGGCAACCT GAACGCGATT GGTTCAAATG GCTATTCCGA TGCCAGCATT GATAGCTTTT TATGGACACC TGGTTTTTAT TATGTGCTGG TCTATCCGCT GGATGAAGAT CAATCGGGTG ATTACAACTT GGCAATTGAA GCAACATTTG ATCAAAATTT GACCAAACCA TTGCCAATTC CCTTTATGAT TAACCAGAGT GAGCCACCAG ATTGCAAACG CTCGCTGTAT ATCACCAACA GCGATTACTT TAATATTTAT GAAGCATTCG CGCCTTATAA CACCAGTGTT TATACCGAAT TACTTGGTTT AGCATCGAAA CCAGGCGATA CAAGCGGATT GGTGCTTGAT TTAGCCAATA TTTCATGGAT TCAGCCAAGT AATGCATTTA GTGATCAATC AACGCAGTGG GATAACAATC GTAATAATCC GTTCTATGCC AATCGGATTG CTGAAGGCGT GCATCATCTA ATCACTCAAT ATGCGCGACG CTGTGCGAGC TTGCGCTATG TCACAATTGT TGGCGGCGAT TACGTTGTGC CATTCTATCG CGTGCCCGAC GAAACCGTGA TTGCCAACGA AGGCGATTAT TTGGCAAGTT CGGGAATTAA CACCAATAGC TTCACCGCCA GCAGCCTGAA ATACAAAACA ATCTTGACTG ATAATATTTA TGGGGCAATC AAGCCAATTC CTTATCGTGG TCGCCAGTTA TGGGTGCCTG AACTCAGCGT AGGCCGCTTA GTCGAAGGTG CTGATGCAAT TCATTTATAT CTCAAGAGCA TGAACCTGCA TACCGATTTC CCGAGTGGCA GCATGATCAA TTTAAATCCT CAACCGCAAC AAGCAAATTT GGTGACAGGT TATGATTTCT TGCTTGATCA AGCCAAGGTT ATCTCGGGAA CATTGACAAA TTTAGTCGGT GCGCGTCCGG TTGGTTTGAT CAGTGATACC TGGGATGCAG ATGATTTACT CTCAATCTGG TGGCCCGATG TCAATAATGG GATGTCCAAC TCGCCCTATC CAGCTCAATC AATTAATGCC CACTTTACCC ACTATCAGGC GATTCCGGCC AATTTTGAAA CCTCAACAGA TGTAGTCGAA GCAAAGGTTC TCTTTCAGCA TCCTTTTAGT CCGGTCGCCT TAGAAGATGC TTATCAACGT AATCGTTTAG GCTATAGCGT TGGCTGCCAC TCAGGCTATA GCGTCTACGA TAACGATGTT GCTGGTGGCC TTACAAACCC AGTTGCGATC GATTTCCCTC AAGCGTATAT GAAGCAACAT GGAGCATGGA TTGGCAATAC TGGTTTTGGC TATGGTGATA GTGATTTAGT CGGCTATTCG GAAAAACTCA TGGCTCAATT TACTCTCGAA ATTGGCCGTG AGTGGAAAAA TCAATCAGGA TTTTATGGGG GCATGCCTAT TGGCTATGCC TTAGTTCGGG CCAAACGCAG CTACTTACGC GAAGGCACAC CAGGCACATT TAGTGTCTAC GATGAGAAAG TGCTGTTGCA ATCAACCTTG TATGGTTTGC CAATGATTCG GGTCTTGGTG CCTAATCCAA TTCCCTTCCC CCCAAATGAT CGCCAAGATT TTGCCTGTTT GACCTGCCCA CCATCAGTTG CCCATCGAGC AACTGTCACC GATCGGGCAA TTGATCGCGA TATTACCTTT AACATTAACT ACACGACTGC CCAAGATAAT GCCCGTGGCA AAATCCTCAG GGCACAAGCA ACCGTGGTCA ACGATACCGA AGGTTTGCTC ACGGGCCAGA TGTTTAATAG CTCGTTGGTC AATGCGGCGG GTTTGCCATC ATTGCCTAAC TTTAGCATTC CACTGTCGCT GACCGATGGT TTTGGGGAAC AACGGATTCA AGGGATTCAG TTCCTTGGTG GAACCACAAC CACCGCCGCA ACCTTTAATC CGGTTATTAC GCGTTTGATT ACTGATGAGA TTTATTTGAG CCACGAACCA ACCTTTGATT TTACTGATCG TTGGTATCCC GACCAACCAT TTAGCTATAG CGTGTTCGAA AGCGATAATC TCGATGATGA TGGCCCGCTC AACCAAGATC GACGTTTCTC GCAGCTCTTG CTAACTCCAA CCCAATTTAA AGGCAACACA TCGGGCGGTG AGTTACGCCA GTTCAGCACC ATTACGCTAC GGCTGAAATA CTTGAGCGAT GGCGCTGATG ATGATTTGCT TGATGATACT TTTGACCCAA TCGTTCGTGA TACCCAGCGC ACTAGCGACG GCAAAATTCG GGCAATTGTG ATCGAGCGCG ATAACGATGG CCCGGGTGAT GAGCTTGATG CTGAGATGGT TGTCAAAACC AACACTGGTG CTTGGCAAAC TGTCAGCATG AATACCTTCC AGATTGGCAC AACTGAACGC TGGCAAATTG AAGGCAGCCT TCCAGCCAAT ACCTTTCTAC CCTTGCAACT GTTGATTCAA GCGGAAGATG AGGCTGGTAA TGTCGGGGTT GAAACCTATG GCGGGCGTTT CGATATTGAT TATGAAATGT ATCTGCCCAA TGTGAATGTC AACCGCTAG
|
Protein sequence | MRRFSKVRLL GICVLLLSLT AYQRTPNSSA QVLRSLVPVP EVESNDTWQN AHDLTSSCLW PSITICDVTG ELDDGEDLDW YKITVRPATT LSISLTDTFG DYQVALFYDL AKPITQTGHV SLGNLNAIGN LNAIGNLNAI GNLNAIGNLN AIGNLNAIGN LNAIGNLNAI GSNGYSDASI DSFLWTPGFY YVLVYPLDED QSGDYNLAIE ATFDQNLTKP LPIPFMINQS EPPDCKRSLY ITNSDYFNIY EAFAPYNTSV YTELLGLASK PGDTSGLVLD LANISWIQPS NAFSDQSTQW DNNRNNPFYA NRIAEGVHHL ITQYARRCAS LRYVTIVGGD YVVPFYRVPD ETVIANEGDY LASSGINTNS FTASSLKYKT ILTDNIYGAI KPIPYRGRQL WVPELSVGRL VEGADAIHLY LKSMNLHTDF PSGSMINLNP QPQQANLVTG YDFLLDQAKV ISGTLTNLVG ARPVGLISDT WDADDLLSIW WPDVNNGMSN SPYPAQSINA HFTHYQAIPA NFETSTDVVE AKVLFQHPFS PVALEDAYQR NRLGYSVGCH SGYSVYDNDV AGGLTNPVAI DFPQAYMKQH GAWIGNTGFG YGDSDLVGYS EKLMAQFTLE IGREWKNQSG FYGGMPIGYA LVRAKRSYLR EGTPGTFSVY DEKVLLQSTL YGLPMIRVLV PNPIPFPPND RQDFACLTCP PSVAHRATVT DRAIDRDITF NINYTTAQDN ARGKILRAQA TVVNDTEGLL TGQMFNSSLV NAAGLPSLPN FSIPLSLTDG FGEQRIQGIQ FLGGTTTTAA TFNPVITRLI TDEIYLSHEP TFDFTDRWYP DQPFSYSVFE SDNLDDDGPL NQDRRFSQLL LTPTQFKGNT SGGELRQFST ITLRLKYLSD GADDDLLDDT FDPIVRDTQR TSDGKIRAIV IERDNDGPGD ELDAEMVVKT NTGAWQTVSM NTFQIGTTER WQIEGSLPAN TFLPLQLLIQ AEDEAGNVGV ETYGGRFDID YEMYLPNVNV NR
|
| |