Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2788 |
Symbol | |
ID | 5734669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3545509 |
End bp | 3546657 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641279931 |
Product | alkanesulfonate monooxygenase |
Protein accession | YP_001545554 |
Protein GI | 159899307 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03565] alkanesulfonate monooxygenase, FMNH(2)-dependent |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAATCC TATGGTTTAT TCCAACCCAC GGCGATGGGC GCTATCTTGC CTCGACCGTG GCCAGCCGCT CGACAACGTT TCCCTATTTG CGCCAAATTG CCCAGGCAGT CGATGAATTG GGCTTTACTG GGGCATTGCT ACCAACTGGC CATTATTGCG AAGATGCCTG GGTCTTGGCC TCAGCACTAT CTGCGGTCAC GCGTCAGATG AAATTTTTGG TGGCAATTCG CCCAGGCCTG ATGTTGCCTG GCCCCGCCGT GCGAATGGCT TCGACTTTCG ACCGCATCTC CAACGGACGG TTGTTGATCA ATGTGGTGGC TGGTGGCGAT AGCGCCGAAG TTGCCAGCGA TGGCTTACAT CTCGATCATG ATCAACGTTA TGCGCTGACT GACGAATTTT TGGATGTTTG GCGGCGCGTG TTGGCTGGCG AACATGTAAC TCTCGAAGGC CAGCATATTC ATGTGACCGA TGGCAAATTA ATGTTACCGC CGTTGCAACA ACCGCATCCG CCGCTCTATT TTGGTGGCTC CTCGCCAGCA GCGCTCGAAG TTGCCGCCAA ACATGTCGAT GTCTACCTGA CTTGGGGTGA ACCGCCAGCC CAAGTCGCCG AGAAAATTAC CCAAGTTCGG GCTTTGGCCG CCAAGCATGG CCGCACAATT CGTTTCGGCA TGCGCATGCA TGTGATCGTG CGCGAAACCA ACCAAGCCGC GTGGGATGCC GCCAACGATC TGATTCGATA TGTTGATGAT GCGGCAATTG CCAATGCGCA AGCGGCTTTT GCCAAGATGG ATTCAGTCGG CCAAAAGCGT ATGCAAGCCT TGCACAATGG CAATCGCAAC CAGCTCGAAG TTAGCCCAAA TTTGTGGGCT GGAGTCGGCT TGGTACGTGG TGGTGCTGGT ACAGCCTTGG TTGGCGATGC CGGAACCGTC GCCGAACGCA TGCTCGAATA TAACGAGCTA GGCATCGATA CCTTTGTGCT TTCAGGTTAT CCTCACCTGG AAGAAGCTTA TCGCGTGGCT GAATTGTTGT TTCCGCGCTT GCCGTTGGGT AGCAAACCAA CCGCGCCTAC CAACCAAACT GGGATCTTTG GCGAGTTGCT GCAGTTGCAC GAATTTTCAC GGCGCGAGCC AGCTCCAACT GGCCAGTAG
|
Protein sequence | MEILWFIPTH GDGRYLASTV ASRSTTFPYL RQIAQAVDEL GFTGALLPTG HYCEDAWVLA SALSAVTRQM KFLVAIRPGL MLPGPAVRMA STFDRISNGR LLINVVAGGD SAEVASDGLH LDHDQRYALT DEFLDVWRRV LAGEHVTLEG QHIHVTDGKL MLPPLQQPHP PLYFGGSSPA ALEVAAKHVD VYLTWGEPPA QVAEKITQVR ALAAKHGRTI RFGMRMHVIV RETNQAAWDA ANDLIRYVDD AAIANAQAAF AKMDSVGQKR MQALHNGNRN QLEVSPNLWA GVGLVRGGAG TALVGDAGTV AERMLEYNEL GIDTFVLSGY PHLEEAYRVA ELLFPRLPLG SKPTAPTNQT GIFGELLQLH EFSRREPAPT GQ
|
| |