Gene Saro_2312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2312 
Symbol 
ID3915657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2452218 
End bp2453627 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content65% 
IMG OID640445068 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_497583 
Protein GI87200326 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.241055 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGCGT TCAAGGGGTT GAAGCCGATT CTCTATGGTG GCCGTGAAGT CTGGCCACTG 
GTCGAGGGCG GCAAGGGCGT GGCGGCGACG AACCACATGA GTTCCGGCGC CTGGGCGGCA
GCCGGCGGCA TCGGCACGGT CAGCGCGGTC AATGCCGACA GCTACGACGC CGAAGGCAAG
ATCGTTCCAC AGGTTTATCA CGCCCTCACG CGCAAGGAGC GCCACGAGGA GCTGATCAAG
TACGCGATCG ACGGCGCGGT CGAGCAGGTC AAGCGAGCCT ATGACATCGC CAGCGGCAAG
GGCGCGATCA ACATCAACGT GCTGTGGGAA ATGGGCGGCG CGCAGCAGGT GCTCGAGGGT
GTTCTGGAAA AGACCCGCGG CCTGGTCACC GGCGTCACCT GCGGCGCCGG CATGCCGTAC
AAGCTGTCCG AGATCGCGGC GCGGTTCAAC GTGAACTATC TGCCCATCGT GTCGTCGGGC
CGTGCATTCC GCGCGCTGTG GAAGCGCGCC TACCACAAGG TTTCGCACCT GCTTGCCGCC
GTGGTCTATG AAGACCCGTG GCTGGCGGGC GGCCACAATG GCCTGTCCAA CGCCGAAGAC
CCGCGCAAGC CGGAAGACCC CTATCCGCGC GTCAAGGCGC TGCGCGACGT GATGCGCGCC
GAAGGCGTTT CGGATGACGT TCCCATCGTC ATGGCGGGCG GCGTCTGGTT CCTGCGGGAA
TGGAACGACT GGATCGACAA TCCCGAGCTT GGGGCGATTG CCTTCCAGTT CGGCACGCGC
CCCCTGCTGA CCGAGGAAAG CCCGATCCCC CAGGGGTGGA AGGACCACCT GCGCACGCTC
GAGCCGGGCG ACGTGTTGCT GCATCGCTTC TCGCCCACGG GGTTCTACTC GTCGGCGGTG
CGTAATCCGT TCCTGCGCGC GCTCGAAGCG CGGTCGGAAC GCCAGATTCC CTATTCGCGG
GTGGAAGCCG GCGAACACAC CGCGCAACTC GACGTCGGCG TAAGGGGCAA GAACTTCTGG
GTGACGCCGA ACGACCTGGC GCGCGCGCGC GAGTGGCACG GTGCCGGTTT CGTCCACGCC
CTTCGCACGC CCGACGACAC GATGGTCTTC GTGACGCCCC AGGAACGCGA TGAAATCCAG
CAGGACCAGA AGGACTGCAT GGGCTGCCTT TCGCACTGCG GGTTCTCGTC GTGGAAGGAT
CACGACGACT ACACGACCGG GCGGCTTGCC GATCCGCGCA GCTTCTGCAT CCAGAAGACC
TTGCAGGACA TCGCGCACGG CGGCGATATC GACCAGAACC TGATGTTCGC GGGCCATGCG
GCATACCGCT TCAAGCAGGA CCCGTTCTAT TCGAACAACT TCACCCCGAC GGTGAAGCAG
TTGGTCGATC GCATCCTGAC CGGCGACTGA
 
Protein sequence
MSAFKGLKPI LYGGREVWPL VEGGKGVAAT NHMSSGAWAA AGGIGTVSAV NADSYDAEGK 
IVPQVYHALT RKERHEELIK YAIDGAVEQV KRAYDIASGK GAININVLWE MGGAQQVLEG
VLEKTRGLVT GVTCGAGMPY KLSEIAARFN VNYLPIVSSG RAFRALWKRA YHKVSHLLAA
VVYEDPWLAG GHNGLSNAED PRKPEDPYPR VKALRDVMRA EGVSDDVPIV MAGGVWFLRE
WNDWIDNPEL GAIAFQFGTR PLLTEESPIP QGWKDHLRTL EPGDVLLHRF SPTGFYSSAV
RNPFLRALEA RSERQIPYSR VEAGEHTAQL DVGVRGKNFW VTPNDLARAR EWHGAGFVHA
LRTPDDTMVF VTPQERDEIQ QDQKDCMGCL SHCGFSSWKD HDDYTTGRLA DPRSFCIQKT
LQDIAHGGDI DQNLMFAGHA AYRFKQDPFY SNNFTPTVKQ LVDRILTGD