Gene Saro_3861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3861 
Symbol 
ID5077472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp28261 
End bp29541 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content62% 
IMG OID640480970 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001165632 
Protein GI146275471 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCCG AAACCATGCC CGTCGCCGCG GGGTCGGAGC CGACTGCGGT CGATTACGCA 
GTCTATCACA GCCAAGCGAT CTTCACTGCC GAACAGGAAA ACATCTTTCG CGGACAGACC
TGGTGCTACC TCGGCCTCGA AGCCGAGCTG GCCAACAGCG GCGATTTCCG TTCGACCCAT
GTCGGCAATA CTCCGGTGGT GGTGACGCGG GCAACCGATG GCACCATCCA TGCCTGGGTA
AATCGCTGCG CCCACAAGGG CGCAACCGTG TGCCGCTCGC TGCGTGGCAA CCAGGCCGAC
GGGGCGTTCG TCTGCGTCTA TCACCAGTGG GCCTACGACG CGACCGGTGC GCTTGTCGGC
GTGCCGTTCC GGCGGGGGCT GAAAGGCGTG GGCGGCTATT CGAAGGAATT CAATATGGCC
GAGCATTCGC TGGAGCGGTT GCGGGTCGAG ACATTCGGCG GCCTCGTGTT CGGCACCTTC
AACTCGACCA TCGCCCCGCT CGACGACTTT CTGGGTCCAG TGATGCGCAA ATACATTCAG
CGCGTCTTCC AGCGCCCGGT CAAGGTTCTG GGCTATGCGC GGCAGTTCAT GGCCGGCAAC
TGGAAGCTCT ATTCGGAAAA CAGTCGTGAC AGCTACCACG GCGGTCTGTT GCACCTGTTC
TATCCGACTT TCGGCATCTA CCGCCAGAGC CAGGAAAGCG CGGGTCTGGT TTCGGACGAG
GGCTACCACA CCGTCTTTAC CGTGTCTAAG CCCAAGGGCG ATGTCGACTA CGGCTCGTTC
GGTGACGAGG CCAACCGCGA GATGCAGGGT GAGGCCAAGT TGCAGGACGA GCGCCTGCTG
GCATTCCGCC CGGAGATCGC TGATGATGTC GGACTGCACA TCCAGTCGAT CTTCCCGTCT
GTCGTTGTCC AGCAGATCCA GAACACCCTC GCCACCAGGC AGATCGTGCC CCACGGGACC
GACAAGACCG AGCTGGTCTG GACCTATTTC GGCTATGCCG ACGATGACGA TGAAACGACT
CGCCACCGCC TTCGCAACCT CAACCTGGTT GGACCGTCTG GGTTGATTTC GATGGAAGAC
GGCGAAGCGG TCGAACTGTG CCAGCAGGGC ACGATCGGTG CCGAAGGCAA GCGCAGCTTC
GTCGAGATGG GCGGGGACGA TGTCCGACCG TCATACGCCC CGATGGGTAT GGATGAAAAT
TCCGTGCGCG GGTTCTGGAA GGGCTATCTC GGGCTGATGG GCAATGCCTT GGCCGATCTC
GCAGCGGAGG GCCGGGCATG A
 
Protein sequence
MNAETMPVAA GSEPTAVDYA VYHSQAIFTA EQENIFRGQT WCYLGLEAEL ANSGDFRSTH 
VGNTPVVVTR ATDGTIHAWV NRCAHKGATV CRSLRGNQAD GAFVCVYHQW AYDATGALVG
VPFRRGLKGV GGYSKEFNMA EHSLERLRVE TFGGLVFGTF NSTIAPLDDF LGPVMRKYIQ
RVFQRPVKVL GYARQFMAGN WKLYSENSRD SYHGGLLHLF YPTFGIYRQS QESAGLVSDE
GYHTVFTVSK PKGDVDYGSF GDEANREMQG EAKLQDERLL AFRPEIADDV GLHIQSIFPS
VVVQQIQNTL ATRQIVPHGT DKTELVWTYF GYADDDDETT RHRLRNLNLV GPSGLISMED
GEAVELCQQG TIGAEGKRSF VEMGGDDVRP SYAPMGMDEN SVRGFWKGYL GLMGNALADL
AAEGRA