Gene Saro_3754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3754 
Symbol 
ID5077902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp392637 
End bp393701 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content66% 
IMG OID640481477 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_001166139 
Protein GI146275979 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGGCG CGCTCCACAC CGTCCTTGTC GAGCGGCTTG GCTGCCGCGC TCCGATCATC 
CAGACCGCCA TGGGCTGGGT GGCGGAGCCC AGCCTCGTCA TCGGCAGCAG CAATGCCGGG
GCCTTCGGGT TCCTCGGCGC GGCGGTGATG ACGCCTGATG AATGCCGCGA AAAGATCCTC
GCCGTTCGCC GTGGGACGGA CCGGCCTTTC GGCGTCAACT TCCATTCGTT CCAGCCCGGC
GCCGACCAGA TCGTGGAACT GATCCTCGCC AACCGTGACC AGGTGCGCGC GGTCAGCTTC
GGCCGCGGCC CGAATGCCAA GATGATCGGG CGCTTCCGGG ATGCGGGCAT TCTCTGCATC
CCTACCGTCG GCGCGGTGAA ACACGCGAAG AAAATGGAAG AACTGGGCGT CGACATGGTC
AGCGTCCAGG GCGGGGAAGG CGGCGGACAT ACCGGTTCGG TGCCGACGAC CGTACTGCTG
CCCCAGGTCC TCGATACAGT GAAGGTGCCC GTCATCGCCA GCGGCGGCTT TGCCGATGGC
CGGGGGCTCG TTGCCGCGCT CGCTTATGGT GCCGTCGGCA TCGCCATGGG CACCCGCTTC
CTGCTTACTC GGGAAAGCCC CGTGCCGGAC AGCGCAAAGG CGGCCTACCT CAAGGCAGGC
ACAGACCAGA TCATCGTCAC GACCAAGCTG GACGGCATCC CCCAGCGCAT GATCCGGACG
CGCCTGATGG ACCGGATCGA AAAGTCCGGA TCGCTTGCCA TGTGGCTGCG CGCCTTCGAG
GCGGGCGCCG CGATGAAGCG CCAGACGGGC GCGTCCTGGC TGCACTTCAT CAAGGCGGCG
CGCGGCATGA CCGGTCACGG GGACGTTCCG CTCAAGCAGG CGATGATGGC CGCAACCATG
CCGATGCTGA TCCAGAAGGC GGTGGTTGAT GGCGACATCG AGAACGGCGT GATGGCGACC
GGCGTCGTCG GCGGCCGGAT ATCCGAGATC CCGACCTGTC AGGAACTGGT CGATCGTATC
ATGGCCGAAG CGCACGGCCG CCTTTCCGCG CTCTGCGCAA GCTGA
 
Protein sequence
MSGALHTVLV ERLGCRAPII QTAMGWVAEP SLVIGSSNAG AFGFLGAAVM TPDECREKIL 
AVRRGTDRPF GVNFHSFQPG ADQIVELILA NRDQVRAVSF GRGPNAKMIG RFRDAGILCI
PTVGAVKHAK KMEELGVDMV SVQGGEGGGH TGSVPTTVLL PQVLDTVKVP VIASGGFADG
RGLVAALAYG AVGIAMGTRF LLTRESPVPD SAKAAYLKAG TDQIIVTTKL DGIPQRMIRT
RLMDRIEKSG SLAMWLRAFE AGAAMKRQTG ASWLHFIKAA RGMTGHGDVP LKQAMMAATM
PMLIQKAVVD GDIENGVMAT GVVGGRISEI PTCQELVDRI MAEAHGRLSA LCAS