Gene Saro_3840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3840 
Symbol 
ID5077451 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp7948 
End bp9300 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content62% 
IMG OID640480950 
Productring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_001165612 
Protein GI146275451 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGACA GCATTGCCGA TCTGGTTGAT TCCCGCACCG GGCGCCAATC GCGCTCGATC 
TACGCGAGCG AAGACATTTA TCGGCAGGAA CTTGAGCGGA TCTTCGGGCG CTGCTGGCTG
TTTCTGGTCC ACACCAGCCA GATTCCGAAG CCGGGCGACT ATTTCCGCAC CTTCATGGGC
GAAGACGATG TGATCGTGAT CCGCCAGAAG GACGGGTCGA TCAAGGCGTT CCTCAACAGC
TGTACCCATC GCGGCAACCG GATCTGCCGC GCCGATCGCG GCAATGCGCG CGCTTTCACC
TGCAACTATC ACGGCTGGTC TTTTTCCCCG GACGGCGCGC TCTCCGGGGT GCCGCTGGAA
AACGAGGCCT ATTTCGGCGA ACTCGACCGC ACCAAGTTCG GCCTGATCCC GGTGACGAAA
GTGGCCGAGT ATAAGGGCCT GGTGTTCGGC TGCTGGGATG CCAATTCGCC CAGCCTCGAT
GACTATCTGG GCGATGCCAA GTTTTTCCTC GATGTCTGGC TGGATGCCAT GCCAGGCGGA
TCGGCACTGC TCGGCGAGAC GCAGAAGATG GTGCTGGGCA CCAACTGGAA GCTGCCAGTC
GAGAACGTCT GCGGCGATGG CTATCACCTG GGCTGGGCCC ATGCCGGCGC TATGGCGGCG
GTCCAGTCGA TGGACCTCAC CGGGCTCAGC GTCGGCAATT CCGGGGTCGA TCTCGATGGC
GGGCTGTCGG TCGCCGGCAT GAACGGGCAC ATGGTCCTGA GCGCGCTCGA CGGCGTTTCC
GGCTATGCCT TCTATCCCGA TCCCAAGCCG ATCCTCGAAT ACCTGGAGGC CAACCGCCAG
ACGGTGATCG ACCGTCTGGG CGAAGTGCGC GGCAGGCAGG TGTGGGGTGC GCAGGTCAAC
ATCACCATTT TCCCCAACCT GCAGCTGCTG CCCGGGCTCA ACTGGTTCCG GGTCTATCAT
CCCAAGGGTC CCGGCCAGAT CGAGCAGTGG ACCTGGGCCA TGGCCGAAAA CGACATGCCC
GAGGCGGTGA AAGCGCAGAT CCTCGAAAAC CAGTGCCTGA CCTTCGGCCT GGCAGGCCTG
TTCGACAACG ACGATGGCGA CAATCTGACC GCCTGCACCG AACAGTCGCG CGGCTGGCGC
ACGGCGCAGA TGGATGTCTA CACCAACATG GCGCTGGGCC GCTCGGGCAA GCGCGAGGGC
TTCCCCGGCG ATATCGCCGC CGGCTTGGTA AGCGAACACA ACCAGCGCTA TTTCTACCGC
CGCTGGCAAG AGCACATGAT GGCGGAAACT TGGGCCGAAG TGCCCACGTA CAACATCAAC
TCGTTGACCG AACAGGAAGC CGAGCATGCT TGA
 
Protein sequence
MNDSIADLVD SRTGRQSRSI YASEDIYRQE LERIFGRCWL FLVHTSQIPK PGDYFRTFMG 
EDDVIVIRQK DGSIKAFLNS CTHRGNRICR ADRGNARAFT CNYHGWSFSP DGALSGVPLE
NEAYFGELDR TKFGLIPVTK VAEYKGLVFG CWDANSPSLD DYLGDAKFFL DVWLDAMPGG
SALLGETQKM VLGTNWKLPV ENVCGDGYHL GWAHAGAMAA VQSMDLTGLS VGNSGVDLDG
GLSVAGMNGH MVLSALDGVS GYAFYPDPKP ILEYLEANRQ TVIDRLGEVR GRQVWGAQVN
ITIFPNLQLL PGLNWFRVYH PKGPGQIEQW TWAMAENDMP EAVKAQILEN QCLTFGLAGL
FDNDDGDNLT ACTEQSRGWR TAQMDVYTNM ALGRSGKREG FPGDIAAGLV SEHNQRYFYR
RWQEHMMAET WAEVPTYNIN SLTEQEAEHA