Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5274 |
Symbol | |
ID | 5737232 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009974 |
Strand | - |
Start bp | 59182 |
End bp | 60741 |
Gene Length | 1560 bp |
Protein Length | 519 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641282438 |
Product | hypothetical protein |
Protein accession | YP_001548029 |
Protein GI | 159901784 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGACTAC ACCATTGCAC TATTGGACAC TATCGTGTTC TTCGCAATCT TCAGATTAAA TTCAATAACA CGGATAACGA ACGTCTAGGG ATTGATTTTC TAGTAGGCCA GAATGGCAGC GGCAAATCAA CTCTGCTTCA AGCTATTACT TTTATAATCC AGCAATTAGA AAAGGGAAGC GATGTGCCCT TTCGCTTCTA TCTAGAGTAC GAACTTGGCT CAGCGAATGC CAAGCAGAGT ATCCGCATTT ATAACTATGA ATTGGATGAA GATTCTAACC AAAAAAAGCT GGACAGGGCA AGAGCAGACA CGCGTACAGT AAATACTGAA TGGCAGGAGC GTCATGGCTC ATTGGAAGAA ATCCTACCAC GAGTAATAAT CCTTACAACA GGCAGAGAGC AGGAATGGCA AACACTTTTG AAACCATCTC AGCAGAGCGT TACGCTTGAT CCGCTTGAAG TTCTATCGGG ATCACTAGCA GATTCCGATA TTCACGCTCA GCATGAGCAA CGACTACTTA GTGAGCGGGT AGGGGCATCA ATTGTTTTTG AACCAAGTGC TCAATCAGAA TCACAGAAAA TCACGTTTGT CCCAACCGAA GCATTGCCAC TTGTAACACT ATGCGGGGTT TTAGCAGAAC TTTCGCATAA TGGCATACTT AACTCAATGA AAGATATTTT CGAAGATGTT CGCATTCGGC AAGTATGTGC GTTCTCTCTC CGCTTTCGAC TCAACCTTGC TGGCTCGAAT GAGCGACGTG AAATTCAGGA GCTTGGGCGA AAGGCTACGC GTGCGGTGCA TATAGGTTCT GACTGGCTAC TCTATTTCGA CTTAACAGAT CAGCAACAAT CGAGTATCCA AGATTTGTTG GGCGACCGGG GTGGAGCTTT CGCTTTCTAC CAACAACTAC GGCGGTTGAG TAGGAATGCA ATTGCAGCAG AGCGGGTGCT TCAAGAGGTT AATATCTTTG TTAAACGTGG CCCGAAGGAC GAAGATCCAG TTAAAGATCG CGAGATTGCT CAGAATGCAC CATTGCACCC ACTTGCTTGG TTGAGTGATG GGGAACGTAG CTTCATAGGC CGGATGAGTC TATTCAGTAT GCTCCGCGAT CAAGACCAAC TTATTCTGCT TGATGAACCT GAAGTTCATT TTAATGACTA CTGGAAACGC CAGATAGTTG ATCGTTTAGC AACATTGCTC GAAGATGGCA AGTGTCATGC CCTGATTACA TCTCACTCCA GCATCACGCT GACTGATGTT CCACGTGAGG ATATTATTGT GTTGCATCGT GGAGAGCAAT ATACTCAGAG TGCTGGGAGT CCTACGCTTA AAACACTGGC TGCTGATCCG AGTGATATTA TTATTCATGT TTTCGATTCG CCTTATGCGA CAGGCCAATA TGCTGTTAAG AAAGTTAAAG CGATATTGGA TGAAGTTGGC CAGCGTAATA ACAAGCAGGC ACAGCAAAAG CTTAAAGACT TGCTTAACGA AGTTGGGCCT GGGTATTGGA GCTATCGAAT CCGGCGGGTA CTGTATAGGA TGAGTAACGA TGCTTCATAA
|
Protein sequence | MRLHHCTIGH YRVLRNLQIK FNNTDNERLG IDFLVGQNGS GKSTLLQAIT FIIQQLEKGS DVPFRFYLEY ELGSANAKQS IRIYNYELDE DSNQKKLDRA RADTRTVNTE WQERHGSLEE ILPRVIILTT GREQEWQTLL KPSQQSVTLD PLEVLSGSLA DSDIHAQHEQ RLLSERVGAS IVFEPSAQSE SQKITFVPTE ALPLVTLCGV LAELSHNGIL NSMKDIFEDV RIRQVCAFSL RFRLNLAGSN ERREIQELGR KATRAVHIGS DWLLYFDLTD QQQSSIQDLL GDRGGAFAFY QQLRRLSRNA IAAERVLQEV NIFVKRGPKD EDPVKDREIA QNAPLHPLAW LSDGERSFIG RMSLFSMLRD QDQLILLDEP EVHFNDYWKR QIVDRLATLL EDGKCHALIT SHSSITLTDV PREDIIVLHR GEQYTQSAGS PTLKTLAADP SDIIIHVFDS PYATGQYAVK KVKAILDEVG QRNNKQAQQK LKDLLNEVGP GYWSYRIRRV LYRMSNDAS
|
| |