Gene Saro_2269 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2269 
Symbol 
ID3916585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2409090 
End bp2410085 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content66% 
IMG OID640445023 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_497540 
Protein GI87200283 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.255645 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCTGC CCCCGCTATT CGCAAACCTG CGCCTGCCCG TCATCGCATC GCCGCTGTTC 
ATCATTTCCT GCCCGGAACT GGTCATCGCG CAGTGCAAGG CCGGCATCGT CGGCTCCTTC
CCTTCGCTCA ACGCCCGCCC GATCAGCCAG CTCGATGAAT GGCTGCACCA GATCACCGAG
GAACTGGCCG CCCACAACCG CGCAAATCCC GACCGGCCCG CTGCGCCTTT CGCGGTGAAC
CAGATCGTCC ACAAGACCAA CAACCGCCTC GACGAGGACA TGGCGCTGTG CGCCAAGTGG
CAGGTTCCCA TGCTCATCAC CTCGCTCGGC GCGCGCGAGG ACGTCTACAA TGCCGCGCAC
GGCTGGGGCG GCATCGTGCT GCATGACGTC ATCAACGACC GCTTCGCGCG CAAGGCGATC
GAAAAGGGCG CGGACGGCCT GATCCCGGTC GCGGCAGGGG CCGGCGGGCA CGCCGGCGCG
CAATCGCCCT TCGCCCTGAT GCAGGAAATC CGCGAGTGGT TCGACGGCCT CGTCGCCCTC
TCCGGCGCTA TCGCGCACGG GCAATCCATC CTTGCGGCAC AGGCACTTGG CGCCGACTTC
GCCTACATCG GCTCAGCCTG GATCGCCACC GAGGAGGCGA ACGCCAACGC GGCCTACAAG
CAGGCCATCG TCGACAGCCG CGCCGACGAC ATCGTCTATT CCAACCTCTT CACCGGGGTC
CACGGCAACT ACCTGCGCTC CTCGATCGTC AACGCCGGGC TCGATCCCGA AAACCTGCCG
GAAAGCGACC CGAGCAAGAT GAACTTCGGC TCGGGCGGCA ATACCGATGC GAAGGCGTGG
AAGGACATCT GGGGGTCGGG ACAAGGCATC GGCGCCGTGT CCGCGATCGA ACCGGTGGCG
ACGCGGGTGG ACCGGCTCGA GCGGCAGTAT CGCCAGGCCG CAGAGGCGCT TTCGGCAAAC
TCCGCACCCT TCCTGAAGGA GACCAGGTAC GTCTAG
 
Protein sequence
MPLPPLFANL RLPVIASPLF IISCPELVIA QCKAGIVGSF PSLNARPISQ LDEWLHQITE 
ELAAHNRANP DRPAAPFAVN QIVHKTNNRL DEDMALCAKW QVPMLITSLG AREDVYNAAH
GWGGIVLHDV INDRFARKAI EKGADGLIPV AAGAGGHAGA QSPFALMQEI REWFDGLVAL
SGAIAHGQSI LAAQALGADF AYIGSAWIAT EEANANAAYK QAIVDSRADD IVYSNLFTGV
HGNYLRSSIV NAGLDPENLP ESDPSKMNFG SGGNTDAKAW KDIWGSGQGI GAVSAIEPVA
TRVDRLERQY RQAAEALSAN SAPFLKETRY V