Gene Saro_3505 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3505 
Symbol 
ID5077654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp113750 
End bp115099 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content64% 
IMG OID640481229 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001165891 
Protein GI146275731 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.231372 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACCC TGCGTGAAGA CGTTCCGCCG CCGATGGACG GCCGCGATCC CTCGCACCTG 
CGCGGCGATG CCATTACCGG CGACCGCTAT TATTCGGCTG AATTCGCCCG GAAGGAATGG
GACGGGCTGT GGACGCGCAT CTGGCACATT GCCGGGCGCA CCGCCGAAAT TCCCGAAGCG
GGCGATTTCC TCGTCCATAC CTTCATGAAG GAATCGGTCA TTGCCGTGCG CCAGGACGAC
GGTTCAGTCC GCGCCTTCTA CAATTCCTGC GGCCATCGCG GGATGCGCAT GGTCGACCAG
TCGAGTTCGG TCGCCGCGTT CCATTGCCCC TACCACGGCT GGCGCTGGGG CATCGACGGC
GTGCTCGAAC ACGCGCAGGA CGCCGACGTC GATTTCAAGC GTGGCAACCC CTGCGGCAAG
CTGAAGCTCA AGGAACTGCG CTGCGGTACC TGGGGCGGCT TCGTCTGGTA CACCATGGCC
GAGGAAGGCC CTTCGCTCGA GGAATACCTC GCGCCGATGC CCGCGCTGTA CAAGAATTAC
CCGATGGATA CCGCGGTCCG GGTCGCGTGG TATCGCATCG AACTCAACGC CAACTGGAAG
TTCGTCACCG ACAACTTCTC GGAAAGCTAT CACACCCGGA CCGCGCATCC CCAGGTCCCG
CCGTGGATCG ACCAGGACGT CGATTCCGCC CGGCATGAGA TGTGGCCCGC CGGCCATGGG
CGCACGGTCC AGCCGATGCG GCCCTCGCTG ACCGACCGGC CCGCCGATGG CACCGAACAC
ATGTTCGCCC ACATCCTGCG CGCGTGGGAC ATCGATCCGG CAAAGTATTC CAGCTACGAG
GAATTCGCGC TCCAGGGGTG GAAGGACCTG AAGCAGGCGA AGCGCCGCCT GTGGCGCGAG
CGGGGTTATG TCCACTACGA GAACATGGAC GACGAGGAGA TCACCGACAG CCCGCACACG
GTGATCTTCC CCAATGTCAC CATCAGCTTC CTGCCCGACA ATCTCATCCT TTTCCGCAGC
GAACCGCACG CGACCGATCC CGAGAAGTGC TACTTCGACC TGTGGTGCAT GGCCTTCCCG
GTCGAGGGGC AGAGCGAGGT GGAATCGATC ATGGCCGGGG TGCGCCCTCT GCGCGAGGTG
GCGGAGTGCG AGCATCGGGT GTTCGATGGC GGGCGCGGCA TTCCCGAACT GGCCGGGCAG
ATCGTCTACC AGGACATGGA ATTGGCCGAA AACATGCAGG CCGGCATGCA TTCTCGCGGA
TATTCAGATG CCTACCTCTC GGACCAGGAG ACCCGCATCC GCTTCTTCCA CGAGGTGCTG
AACGACTGGA TCGAGGGCCG GAAGGGCTGA
 
Protein sequence
MTTLREDVPP PMDGRDPSHL RGDAITGDRY YSAEFARKEW DGLWTRIWHI AGRTAEIPEA 
GDFLVHTFMK ESVIAVRQDD GSVRAFYNSC GHRGMRMVDQ SSSVAAFHCP YHGWRWGIDG
VLEHAQDADV DFKRGNPCGK LKLKELRCGT WGGFVWYTMA EEGPSLEEYL APMPALYKNY
PMDTAVRVAW YRIELNANWK FVTDNFSESY HTRTAHPQVP PWIDQDVDSA RHEMWPAGHG
RTVQPMRPSL TDRPADGTEH MFAHILRAWD IDPAKYSSYE EFALQGWKDL KQAKRRLWRE
RGYVHYENMD DEEITDSPHT VIFPNVTISF LPDNLILFRS EPHATDPEKC YFDLWCMAFP
VEGQSEVESI MAGVRPLREV AECEHRVFDG GRGIPELAGQ IVYQDMELAE NMQAGMHSRG
YSDAYLSDQE TRIRFFHEVL NDWIEGRKG