Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2414 |
Symbol | |
ID | 5539895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 3107245 |
End bp | 3108810 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640894544 |
Product | 4-hydroxyphenylacetate 3-hydroxylase |
Protein accession | YP_001432512 |
Protein GI | 156742383 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2368] Aromatic ring hydroxylase |
TIGRFAM ID | [TIGR02310] 4-hydroxyphenylacetate 3-monooxygenase, oxygenase component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTTG AGACTGTGGC AAAAACGACG GTACCCCTCA CCGGCGAGGA GTATCTGGAA AGTCTGCGTG ATGGACGTGA AATCTGGATC TATGGCGAGC GCGTCAAAGA CATTACCACT CACCCGGCGT TCCGCAACGC TACCCGCATG GTTGCCCGCC TCTACGATGC ACTGCACGAC GCTGAGAAGC AATCGGTATT AACCTGCCCT ACCGACACCG GCAATGGCGG TTTCACCCAC AAGTTTTTCC GCGCCTCGCG CAGCGCAGAC GACCTGGTCG GCGCGCGTGA TGCCATCGCC GAATGGGCGC GGTTGACCTA CGGATGGATG GGGCGCAGTC CTGATTACAA AGCCGCTTTT CTGGCAACGC TTGGCGCGAA TGCGGCGTTT TACTCCCCCT ACCAGGAGAA TGCGCGGCGT TGGTACCGCG AATCACAGGA GCGGGTGCTC TACTTCAACC ACGCGATTGT CAACCCGCCA ATTGATCGTA ACCGTCCGCC GGACGAAATC CGCGATGTGT ACATGCATGT CGAGCGCGAG ACCGACGCCG GATTGATCGT CAGTGGCGCA AAGGTCGTTG CTACCGGTTC GGCACTGACA CACTATAACT TCATTGCGCA CTACGGTCCG CTGCCGATCA GGAGCAAAGA GTTCGCCCTG ATCTTCATCG TGCCGATGGA TGCCCCCGGC GTGAAGTTGA TCGCCCGTCC CTCGTATGAG ATGGCGGCAG AAGTGATGGG CAGCCCATTC GATTATCCGC TTTCGAGCCG CCTCGACGAG AACGACTCGG TGATGATCTT CGATCAGGTG TTGATCCCCT GGGAGAATGT CTTCGTCTAC GGCGATGTCG AGAAGGTCAA CGCCTTCTTC CCGCTCTCCG GCTTTATTCC GCGCTTTACG TTCCACGGCT GCACGCGCAT GGCCGTCAAA CTCGACTTCA TTGCCGGTCT GTTCCTGAAG GCGGTCGAAG CGACAGGCGC GAAGGAATTC CGTGGCGTGC AGGCGCGCGT CGGCGAGGTG CTTGCCTGGC GCAACCTGTT CTGGGCAATC AGTGATGCGA TGGCGCGCAC GCCGATCCCC TGGAATGATG GCGCGGTGCT GCCCAACCTG GATTATGGAC TGGCGTATCG CGTGTTTGCA ACGGTAGCAT ACCCGCGGAT CAAGGAACTG ATCGAGAGCG ATGTCGCCAG CGCGCTGATC TATCTGAACT CGCACGCGGT CGATTTCAAG ACCCCCGAAA TCCGTGGTTA TCTCGACAAG TATCTGCGCG GATCAAATGG CTACTCGTCG CTTGATCGCG TCAAACTGAT GAAATTGCTG TGGGACGCGA TCGGCTCCGA GTTTGGTGGA CGCCACGAAC TGTACGAGCG CAACTACGCC GGCAACCACG AAAACATTCG CCTGGAGGTG CTGCTGACGG CGATGGCGAC CGGCGCCGCC GACCAGTACA AAGGGTTCGC CGATCAATGT CTCAGCGAGT ATGACCTCGA CGGCTGGACG GTTCCCGATC TGATCAACCC TGATGATGTG AACGTCATCT TACGACGGTT TGGCAACGGC AAGTAA
|
Protein sequence | MTVETVAKTT VPLTGEEYLE SLRDGREIWI YGERVKDITT HPAFRNATRM VARLYDALHD AEKQSVLTCP TDTGNGGFTH KFFRASRSAD DLVGARDAIA EWARLTYGWM GRSPDYKAAF LATLGANAAF YSPYQENARR WYRESQERVL YFNHAIVNPP IDRNRPPDEI RDVYMHVERE TDAGLIVSGA KVVATGSALT HYNFIAHYGP LPIRSKEFAL IFIVPMDAPG VKLIARPSYE MAAEVMGSPF DYPLSSRLDE NDSVMIFDQV LIPWENVFVY GDVEKVNAFF PLSGFIPRFT FHGCTRMAVK LDFIAGLFLK AVEATGAKEF RGVQARVGEV LAWRNLFWAI SDAMARTPIP WNDGAVLPNL DYGLAYRVFA TVAYPRIKEL IESDVASALI YLNSHAVDFK TPEIRGYLDK YLRGSNGYSS LDRVKLMKLL WDAIGSEFGG RHELYERNYA GNHENIRLEV LLTAMATGAA DQYKGFADQC LSEYDLDGWT VPDLINPDDV NVILRRFGNG K
|
| |