Gene Rcas_2414 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2414 
Symbol 
ID5539895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3107245 
End bp3108810 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content59% 
IMG OID640894544 
Product4-hydroxyphenylacetate 3-hydroxylase 
Protein accessionYP_001432512 
Protein GI156742383 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2368] Aromatic ring hydroxylase 
TIGRFAM ID[TIGR02310] 4-hydroxyphenylacetate 3-monooxygenase, oxygenase component 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTTG AGACTGTGGC AAAAACGACG GTACCCCTCA CCGGCGAGGA GTATCTGGAA 
AGTCTGCGTG ATGGACGTGA AATCTGGATC TATGGCGAGC GCGTCAAAGA CATTACCACT
CACCCGGCGT TCCGCAACGC TACCCGCATG GTTGCCCGCC TCTACGATGC ACTGCACGAC
GCTGAGAAGC AATCGGTATT AACCTGCCCT ACCGACACCG GCAATGGCGG TTTCACCCAC
AAGTTTTTCC GCGCCTCGCG CAGCGCAGAC GACCTGGTCG GCGCGCGTGA TGCCATCGCC
GAATGGGCGC GGTTGACCTA CGGATGGATG GGGCGCAGTC CTGATTACAA AGCCGCTTTT
CTGGCAACGC TTGGCGCGAA TGCGGCGTTT TACTCCCCCT ACCAGGAGAA TGCGCGGCGT
TGGTACCGCG AATCACAGGA GCGGGTGCTC TACTTCAACC ACGCGATTGT CAACCCGCCA
ATTGATCGTA ACCGTCCGCC GGACGAAATC CGCGATGTGT ACATGCATGT CGAGCGCGAG
ACCGACGCCG GATTGATCGT CAGTGGCGCA AAGGTCGTTG CTACCGGTTC GGCACTGACA
CACTATAACT TCATTGCGCA CTACGGTCCG CTGCCGATCA GGAGCAAAGA GTTCGCCCTG
ATCTTCATCG TGCCGATGGA TGCCCCCGGC GTGAAGTTGA TCGCCCGTCC CTCGTATGAG
ATGGCGGCAG AAGTGATGGG CAGCCCATTC GATTATCCGC TTTCGAGCCG CCTCGACGAG
AACGACTCGG TGATGATCTT CGATCAGGTG TTGATCCCCT GGGAGAATGT CTTCGTCTAC
GGCGATGTCG AGAAGGTCAA CGCCTTCTTC CCGCTCTCCG GCTTTATTCC GCGCTTTACG
TTCCACGGCT GCACGCGCAT GGCCGTCAAA CTCGACTTCA TTGCCGGTCT GTTCCTGAAG
GCGGTCGAAG CGACAGGCGC GAAGGAATTC CGTGGCGTGC AGGCGCGCGT CGGCGAGGTG
CTTGCCTGGC GCAACCTGTT CTGGGCAATC AGTGATGCGA TGGCGCGCAC GCCGATCCCC
TGGAATGATG GCGCGGTGCT GCCCAACCTG GATTATGGAC TGGCGTATCG CGTGTTTGCA
ACGGTAGCAT ACCCGCGGAT CAAGGAACTG ATCGAGAGCG ATGTCGCCAG CGCGCTGATC
TATCTGAACT CGCACGCGGT CGATTTCAAG ACCCCCGAAA TCCGTGGTTA TCTCGACAAG
TATCTGCGCG GATCAAATGG CTACTCGTCG CTTGATCGCG TCAAACTGAT GAAATTGCTG
TGGGACGCGA TCGGCTCCGA GTTTGGTGGA CGCCACGAAC TGTACGAGCG CAACTACGCC
GGCAACCACG AAAACATTCG CCTGGAGGTG CTGCTGACGG CGATGGCGAC CGGCGCCGCC
GACCAGTACA AAGGGTTCGC CGATCAATGT CTCAGCGAGT ATGACCTCGA CGGCTGGACG
GTTCCCGATC TGATCAACCC TGATGATGTG AACGTCATCT TACGACGGTT TGGCAACGGC
AAGTAA
 
Protein sequence
MTVETVAKTT VPLTGEEYLE SLRDGREIWI YGERVKDITT HPAFRNATRM VARLYDALHD 
AEKQSVLTCP TDTGNGGFTH KFFRASRSAD DLVGARDAIA EWARLTYGWM GRSPDYKAAF
LATLGANAAF YSPYQENARR WYRESQERVL YFNHAIVNPP IDRNRPPDEI RDVYMHVERE
TDAGLIVSGA KVVATGSALT HYNFIAHYGP LPIRSKEFAL IFIVPMDAPG VKLIARPSYE
MAAEVMGSPF DYPLSSRLDE NDSVMIFDQV LIPWENVFVY GDVEKVNAFF PLSGFIPRFT
FHGCTRMAVK LDFIAGLFLK AVEATGAKEF RGVQARVGEV LAWRNLFWAI SDAMARTPIP
WNDGAVLPNL DYGLAYRVFA TVAYPRIKEL IESDVASALI YLNSHAVDFK TPEIRGYLDK
YLRGSNGYSS LDRVKLMKLL WDAIGSEFGG RHELYERNYA GNHENIRLEV LLTAMATGAA
DQYKGFADQC LSEYDLDGWT VPDLINPDDV NVILRRFGNG K