Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2024 |
Symbol | |
ID | 5733913 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2516191 |
End bp | 2517573 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279168 |
Product | xenobiotic compound monooxygenase A subunit |
Protein accession | YP_001544795 |
Protein GI | 159898548 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCGC AACGACAGAT GAAACTTGGG GCTTTTCTGC CAGCACCTGG CCACCATGTT GCCGCGTGGC GACACCCGAA CACTCCAGCT AATGCTGGGC TTGAAATTCA ACACTATACC CAGGTGGCTC AAACCGCTGA ACGTGGCAAA TTTGATATGC TTTTCCTCTC GGATGGAGTT GGCATCCGCA CCCATTATAA AGATGAAGAT GAATTAAGCC GTTGGGGTCG GATTGTTCAG TTTGAGCCAC TGACCTTACT TTCGGCCTTA GCCATGGTTA CCCAAAAGAT TGGTTTGACG GCAACTGCTT CAACCACCTA TAACGAGCCA TTTCATATTG CCCGCAAATT TGCTTCGCTC GATTTTCTGA GCAATGGGCG AGCTGGGTGG AATGTTGTGA CCTCGGTGAC CGATGTTGAG GCCCAAAATT TCAACCTTCA ACACCAACCT GATCATGCCA CCCGTTATCG GCGGGCACGC GAATTTATGG ATGTGGTAAC AGGATTGTGG GATAGTTGGG AGGATGATGC CTTTATCTTC GACAAAGCCA CAGGCCGCTA TTTCGAACCA CAAAAACTAC ATATGTTGCA CCATCGTGGC GAATTTTTTC AGGTACGCGG GCCGCTGAAC CTTGCTCGTT CGCCCCAAGG CTACCCAGTT ATTGTGCAGG CTGGCTCATC AGAAGACGGT CAAGATTTTG CGGCTCAATG GGCCGAAGTG ATTTTTACCG CCCATCAAAC GCTTGAGCAA GCCCAAACAT TTTATCGTGG CATCAAAGGC CAAATGATTA AGCATGGACG CTCGCCTGAA CAAGCCAAGG TTATGCCTGG AGTGTTTGCA GTGGTTGGGC AAACCAGAGC CGAGGCCGAA GCCAAATATG CAATCTTACA AGAACTGGTT GATCCGGTGG TTGGTTTAGG GCTATTGACC GGATTGTTGG GTGATGTTGA TATTTCAGGC TATCCCTTGG ATGGGCCATT GCCAGAATTA CCGGAAACCC AAGGCAGCAC CAGCCGCCAA AAACTCGTCT ACGAGCAAGC CCAACGCCAA GGCCTCACGA TTCGCCAATT GTATCTCTCG GTTGCAGGCG GGCGAGGCCA TCGCTTTATT CTTGGAACCC CCAGCGAGAT CGCCAATCAA CTTGAGGATT GGTTTGTGAA CGAGGCTGCT GATGGCTTTA ACATCATGCC GCCAAGCTTA CCTGATGGCT TAAACGACTT TGTTGATTTG GTGATTCCTG AATTACAACG CCGTGGATTG TTTCGAACTG ACTACGAAGG CACAACCTTA CGTGACCATC TAGGGCTTGA TCGCCCGCTC AATCGCCCGA ACAAGAGTAC TGCCGAACGT GCCACGCTGG CGATTGCCCG AGGTGCTGAA TGA
|
Protein sequence | MKPQRQMKLG AFLPAPGHHV AAWRHPNTPA NAGLEIQHYT QVAQTAERGK FDMLFLSDGV GIRTHYKDED ELSRWGRIVQ FEPLTLLSAL AMVTQKIGLT ATASTTYNEP FHIARKFASL DFLSNGRAGW NVVTSVTDVE AQNFNLQHQP DHATRYRRAR EFMDVVTGLW DSWEDDAFIF DKATGRYFEP QKLHMLHHRG EFFQVRGPLN LARSPQGYPV IVQAGSSEDG QDFAAQWAEV IFTAHQTLEQ AQTFYRGIKG QMIKHGRSPE QAKVMPGVFA VVGQTRAEAE AKYAILQELV DPVVGLGLLT GLLGDVDISG YPLDGPLPEL PETQGSTSRQ KLVYEQAQRQ GLTIRQLYLS VAGGRGHRFI LGTPSEIANQ LEDWFVNEAA DGFNIMPPSL PDGLNDFVDL VIPELQRRGL FRTDYEGTTL RDHLGLDRPL NRPNKSTAER ATLAIARGAE
|
| |