Gene Saro_3306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3306 
Symbol 
ID3915953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3524917 
End bp3525945 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content67% 
IMG OID640446091 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_498575 
Protein GI87201318 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.152065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCG CTACGCTCCA GACTGCCACG CCCCGCACTT CCGCACTGAT GCGTCGCGGC 
GCCGAGTTCC TCGGCACGAG CCACGCCATA CTGTGCGGAG CGATGAGCTG GGTTTCGGAG
CGACACCTGG TCTCGGCCAT CAGCAATGCC GGCGGCTTCG GCGTGATCGC CTGCGGTGCG
ATGACGCCCG AATTGCTGGA CCGCGAGATC GCGGCGACCA AGGCCATGAC GGACAAGCCG
TTCGGCGTGA ACCTGATCAC CATGCACCCG GCCCTGTTCG ACCTGATCGC GGTCTGCGCG
AACCACAAGG TAGGCCATGT CGTGCTGGCC GGCGGCATCC CGCCAAAGGG CAGCGTCGAG
GCAATCAAGG CGTTCGGAGC CAAGGTCCTG GTGTTCGCCC CCACGCTGGC GCTGGCCAAG
AAGCTGCTGC GCTCGGGCGC GGACGCGCTG GTCATCGAAG GGTCGGAAGC GGGCGGGCAC
ATCGGGCCTG TCTCCACCTC TGTCCTGGCG CAGGAATTCC TGCCCGCGCT GGCCGAAGAA
CACGTGGTCT TCGTGGCGGG CGGCATCGGG CGCGGCGAGA TGATCGCGAG CTATCTCGAA
ATGGGTGCAT CGGGCGTCCA GCTCGGCACC CGCTTTGCCT GCGCAACGGA ATCGATCGCG
CACCCGGCGT TCAAGCAGGC GTTCTTCCGC GGCAATGCGC GCGATGCCGT GGCTTCGGTG
CAGGTCGACC CGCGTCTGCC GGTGATCCCG GTCCGCGCAC TGAAGAACAA GGGAACCGAA
GAGTTCACCG CCAAGCAGGT CGAAGTGGCC AAGATGCTGG ACGAGGGCAA GGTCGACATG
GCGGCGGCGC AGCTCGAGAT CGAGCACTTC TGGGCAGGCG CGCTGCGCCG CGCGGTGATC
GACGGCGATG TGGAACGCGG TTCGGTCATG GCCGGACAGT CGGTCGGCAT GGTGACCCGG
GAAGAGCCCG TGGCTGACAT CATCGCGCAG CTCATGGCGG AAAGCGAAAC GGCCCTGACG
CGCCGCTGA
 
Protein sequence
MTTATLQTAT PRTSALMRRG AEFLGTSHAI LCGAMSWVSE RHLVSAISNA GGFGVIACGA 
MTPELLDREI AATKAMTDKP FGVNLITMHP ALFDLIAVCA NHKVGHVVLA GGIPPKGSVE
AIKAFGAKVL VFAPTLALAK KLLRSGADAL VIEGSEAGGH IGPVSTSVLA QEFLPALAEE
HVVFVAGGIG RGEMIASYLE MGASGVQLGT RFACATESIA HPAFKQAFFR GNARDAVASV
QVDPRLPVIP VRALKNKGTE EFTAKQVEVA KMLDEGKVDM AAAQLEIEHF WAGALRRAVI
DGDVERGSVM AGQSVGMVTR EEPVADIIAQ LMAESETALT RR