Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5232 |
Symbol | |
ID | 5737190 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 338223 |
End bp | 339638 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641282396 |
Product | hypothetical protein |
Protein accession | YP_001547987 |
Protein GI | 159901741 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0774837 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCGTC TTGCCAATAC CGCCAATCAT GGCTTTCAAG CGCTCGATCC TGCTGCTGTC CAGCTGCTCG CCACCTATGT CACCGCCGCC AACCCTGCCG CCTGTCGGAT CGCTGATCCC TGCGCAGGCG AAGGCGCAGC CGCGTGGCAA TTGGCCAGCG CATGGGGCAT TCCCGCCGAG CGCCTCTATC TCAACGAACT GCATAGTGAG CGGGCGATTG CCTGCCGAGG GATTACGCCC AATACCACCA GCTGCGACAC CGTGCGCTGG CTCCGTGCCA ATCGCCATGG CCTGCAACTG GTCTATCTCA ATCCCCCCTT CGGCACGCAA AGCGCTGGCG AGGAGCGGAG TGAACTCCAG TTCTTTCGCC GCGTGATCGA GGAAGGTACT TGGCTGCAAC CGGGTGGGAT CGCGATTCTC GTGACCCCGC AGGATGTGTT TGCCAAACCT GCCGTCACCC AGCATCTCGC CCGCCATTAT GACAACTTAA CTATCATGGC TCTGCCTGCC GCCCTCCGGC GCTGGCGTGA GGCCATCGTG ATCGGCGTGC GCCGCAGCCG TGCCCGCAGT GGCCAAGCCC TCAGCGACAC GATCACCACC CTGACCACCC AACTGGCGCA GCCCTTGCCG GAACTCACCC TGCAAGCCGC GCCACGCTAC ACCATTCCCG TGGCCACCAG CTCCACCATC GTGTGGGAGG ATGCCACGAT TGGCACGCCC GACCAAGCCA CCACCGATGT GGTGATGACG GGCGGAGCCA CGAATACCCG CGCCTACCGC AGTGCCATTG ATGCCCTGCG CGTCGCCCAT CTGCGTCCCC TCGGCCCGCT CTCCGCTACC GCTGCGGCGG CACGGATTGC CACCGGGGAG ATTAACGGCG CGACCATCGT CATTGACGGT CGGCCCCATC TGATCAAAGG CTCGACCACC GAAGACCAAA CGGTCTGGGT CGAAACCAAG GAAGAAGGCG ATACCCTCAC GGTCATCACC CACCATATCA CGCGGCAAGT GCCCGTGGTC ATGGCGGTCG ATGTGGCCGA TGGCAGCGTG CGCCGCTATG AAGGCGATGC CGGATTACAA AAGCTTCTCG CTGATCCGGC GACCGCTGAG GCCTTGCTCG CCGCCATCAC CGCCGTTGCC CCACCCGTCT ATGCCTACGA TATGGATGCC CAGACCGCTG CGACCTTGGC CATGCTCAAA CGCAAAGATG GCCGCACATT GCCGGGCTAC GAGAATGGCC TGCTGCCCAT GCAGAAGCAT GTGGTGGCCG GAATTACGCG CTATTTGACC ACCCCCGACC CGCGCACGGG CACGCGTCCG AAAGGAACGC TGCTCAATGC CGAGATGGGT GCGGGGAAGT CCACCATGGG CATTGCGATT GCCCACTGGT TTCACCAACA ATCCTTGACC ACATAA
|
Protein sequence | MARLANTANH GFQALDPAAV QLLATYVTAA NPAACRIADP CAGEGAAAWQ LASAWGIPAE RLYLNELHSE RAIACRGITP NTTSCDTVRW LRANRHGLQL VYLNPPFGTQ SAGEERSELQ FFRRVIEEGT WLQPGGIAIL VTPQDVFAKP AVTQHLARHY DNLTIMALPA ALRRWREAIV IGVRRSRARS GQALSDTITT LTTQLAQPLP ELTLQAAPRY TIPVATSSTI VWEDATIGTP DQATTDVVMT GGATNTRAYR SAIDALRVAH LRPLGPLSAT AAAARIATGE INGATIVIDG RPHLIKGSTT EDQTVWVETK EEGDTLTVIT HHITRQVPVV MAVDVADGSV RRYEGDAGLQ KLLADPATAE ALLAAITAVA PPVYAYDMDA QTAATLAMLK RKDGRTLPGY ENGLLPMQKH VVAGITRYLT TPDPRTGTRP KGTLLNAEMG AGKSTMGIAI AHWFHQQSLT T
|
| |