Gene Saro_3901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3901 
Symbol 
ID5077385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp70559 
End bp71938 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content58% 
IMG OID640481008 
Productring hydroxylating dioxygenase, alpha subunit 
Protein accessionYP_001165670 
Protein GI146275509 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.682799 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGGAT CGGCGGCACT CGTCGATAAT GCCAACGCGA GCCAGTCTCG CCGGGTGTTC 
TGGGATCAGG ACGTCTATCA GCTGGAGCTG GAGCGGATAT TCTCGCGATG CTGGCTCATG
CTCGGACATG ATTCGCTAGT TCCCAAACCG GGCGACTTCA TTACGACGTA CATGGCAGAA
GACCGTGTCA TCTTGTCGCG GCAGCCGGAC GGTTCGCTGA AGGCGTTCAT CAATTCCTGC
ACTCACCGCG GCAACCAGAT CTGTCATGCC GACAGCGGCA GCGCCAAGGC GTTCGTGTGC
AATTATCATG GTTGGGTTTT CGGTCAGGAT GGTTCGCTCG TCGATGTTCC GATGGAAGAG
CGGTGCTATC ACAGCGATCT CGACAAATCC AAGCTGGGGC TCGCGCCGAT CCGGGTCGAA
ACTTACAAGG GCTTCATCTT CGGCTGCCAT GATCCCGAAG CGCCCTCGCT TGAGGACTAT
TTGGGGGACT TCTGCTGGTA CCTCGATACG ATCTGGGACG GTCCGGACGG TGGTCTGGAA
CTGCTCGGGC CGCCGTTGAA GAGCACCCTC GCCTGCAATT GGAAAGTCCC GACCGAGAAC
TTCGTCGGCG ATGGGTATCA CGTGGGCTGG ACGCATGCCG CCGCTCTCCA GATGATCGGG
GGCGAGCTGG CTGGCCTGTC GGGCAATCGC GCCGACATGC CGTTTGACGA CCTTGGTCTG
CAATTCACCA TGCGGCATGG CCACGGGTTT GGCCTGATCG ATAACGCGGC GACTGCGATC
CACGTCAAGC GCGACGGGTA CGTCAAATAT CTCGAGGAGA CGCGGGGCGG AATTCGCGAA
AAATTCGGGC CGGAGCGCGA ACGGCTCTAT GTCGGTCACT GGAATACGTC GATCTTCCCA
AACTGTTCGT TCCTCTACGG AACCAACACC TTCAAGATCT GGCATCCGCG CGGGCCGCAT
GAAATCGAGG TCTGGACCTA TACCATGGTA CCGAAGAATG CCGACACCGA AACTAAGCGG
TCGATCCAGC GCGAAGCGAT CCGTTCATTT GGTACGGCGG GAACGCTCGA AAGCGACGAT
GGCGAAAACA TGTCGTCGGC CACCTACAAC AACAACGGTA TCATCACCCG CAAGGGGCGG
ATGAATTCGA GCATGGGCAA GGACCGCGAA GGGCCGCACC CCGTCTATCC AGGAATTGTC
GGGGTCAGCT TCATCGGCGA AACCTCGTAT CGAGGCTTTT ATCGTTTCTG GCAAGAAATG
CTCGATGCGC CAGATTGGGC CGCCATCCGG GCCAATGACG ATACCTGGGA TGCAATGTGG
ACCAACCGTA ATTTCTGGCC TGAACGTCTG TCGGCGAAGC AAGCCGAGCC GCAAGACTGA
 
Protein sequence
MNGSAALVDN ANASQSRRVF WDQDVYQLEL ERIFSRCWLM LGHDSLVPKP GDFITTYMAE 
DRVILSRQPD GSLKAFINSC THRGNQICHA DSGSAKAFVC NYHGWVFGQD GSLVDVPMEE
RCYHSDLDKS KLGLAPIRVE TYKGFIFGCH DPEAPSLEDY LGDFCWYLDT IWDGPDGGLE
LLGPPLKSTL ACNWKVPTEN FVGDGYHVGW THAAALQMIG GELAGLSGNR ADMPFDDLGL
QFTMRHGHGF GLIDNAATAI HVKRDGYVKY LEETRGGIRE KFGPERERLY VGHWNTSIFP
NCSFLYGTNT FKIWHPRGPH EIEVWTYTMV PKNADTETKR SIQREAIRSF GTAGTLESDD
GENMSSATYN NNGIITRKGR MNSSMGKDRE GPHPVYPGIV GVSFIGETSY RGFYRFWQEM
LDAPDWAAIR ANDDTWDAMW TNRNFWPERL SAKQAEPQD