Gene Saro_1176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1176 
Symbol 
ID3916473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1215757 
End bp1216803 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content71% 
IMG OID640443912 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_496455 
Protein GI87199198 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.829209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCGCCTCG CACAGCGACT GGGCCTGCGG CATCCGCTGA TACAGGCACC GATGGCGGGG 
ACATCCACCC CCTCCCTTGC CGCTGCCGTG TGCGAGGCGG GCGCGCTGGG GTCCGTCGCG
GTGGGTGCGG TCGATGCCGG GACGGCGCGA ACGATGATCG CCGACCTGCG GGCGCGGACC
GCGCGGCCGT TCAACGTCAA TGCCTTCGTC CACCACAGGG CCCTGCGCGA CCTTGCGGCG
GAACAGGCAT GGATCGCGGC GATGGCGCCG CTGTTCGAAC GGTTCGGGGC CGCGCCACCT
GCCGCGTTGA ACGAGATCTA CCGTTCGCTG AACGACGATC CGGACATGCT GGCAGTGCTG
GTGGAGGCCG CGCCTGCGGT GGTGAGCTTC CATTTCGGAC TGCCGACGGA CGAGGCCATC
GCCGCGTTGA AGGCGCGTGG GTGCATGCTG ATGGCGAGCG CCACGTCGCT TGCCGAGGCC
GAGGCGGCGG TCGCGGCCGG CATGGATGCG GTAGTCGCAC AAGGGTTCGA GGCGGGCGGC
CACCGGGGCG TATTCGATCC CGAAGCGCCG GACGAACGGA TGCCGACGCT TGACCTGGTG
CGGTTGCTGT CGTCCCGGCT GGACGTTCCC GTAATCGCGG CGGGCGGGAT CATGGACGGG
GCGGACATAC GCCGCGCACT GGACGCCGGA GCGGATGCGG CACAACTTGG CACGGCATTC
GTGGGCTGTC CCGAAAGCGC GGCGGACGCC GGCTATCGCG CGATGCTGGC GCGGGCCAAG
GGCACGACGC TGACGGCGGC GATATCGGGG CGCCCCGCCA GGTGCCTCGA CAACGATTTC
GTCGCGTGGG CGCGCGATAC CGATGCGCGC GTGCCCGGCT ATCCGGTGAC GTACGATGCC
GGAAAGGCGC TGATCGCGGC GGCAAAGGGC GCGGGCGAAT GCGGGTTCGG CGCGCATTGG
GCCGGAACGC AGTTCGCGCG CGCGCGGCCC ATGCCCGCAG GGGAACTGGT CATGCTTCTG
GCGCAGGAGG CCGGGTTCGA TGCCTGA
 
Protein sequence
MRLAQRLGLR HPLIQAPMAG TSTPSLAAAV CEAGALGSVA VGAVDAGTAR TMIADLRART 
ARPFNVNAFV HHRALRDLAA EQAWIAAMAP LFERFGAAPP AALNEIYRSL NDDPDMLAVL
VEAAPAVVSF HFGLPTDEAI AALKARGCML MASATSLAEA EAAVAAGMDA VVAQGFEAGG
HRGVFDPEAP DERMPTLDLV RLLSSRLDVP VIAAGGIMDG ADIRRALDAG ADAAQLGTAF
VGCPESAADA GYRAMLARAK GTTLTAAISG RPARCLDNDF VAWARDTDAR VPGYPVTYDA
GKALIAAAKG AGECGFGAHW AGTQFARARP MPAGELVMLL AQEAGFDA