Gene Shewmr4_2669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2669 
Symbol 
ID4253240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp3186831 
End bp3188483 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content53% 
IMG OID638119304 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_734797 
Protein GI113971004 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID[TIGR02814] PfaD family protein 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAATA CCACACTCGA TAATAAAAAA CTCAGTCCTT GGCCATGGCA GGTTGATGAA 
GCCGCCATCA GTTTCGATAT CGATTCCCTT GGCAAAAAAC TTAAAGATCT CAATCAAGCC
TGTTACTTAG TCAATCATGC CGAAAAAGGC CTAGGCATAG CCCAAAGTGC CGAGGTTGTG
GGTCTTGCAG AAGCTCAACC CAGCAATGGC TTACACAAAA GCAATGGTTT GCATCCTGTG
AGCGCCTTCG CCCCCGCCCT TGGCACCCAA AGCTTGGGTG ACAGTAACTT TCGCCGTGTG
CATGGGGTGA AATACGCCTA CTACGCGGGC GCCATGGCCA ACGGCATCGC CTCGGAAGAG
TTAGTTATCG CCTTAGGTCA GGCGGGCATT TTGTGCTCCT TTGGCGCGGC AGGGTTAATT
CCATCGCGCG TTGAAGCCGC GATTAAACGC ATTCAAACGG CATTGCCCAA TGGCCCCTAC
GCCTTTAACC TCATTCACAG CCCGAGCGAG CCAGCGCTTG AACGTGGCAG TGTCGAACTC
TTCCTCAAGC ATCAAGTGCG TACGGTTGAG GCTTCGGCCT TCTTGGGCTT AACACCGCAA
ATCGTCTATT ACCGCGCCGC AGGCCTGAGC CGCGACGCCA ACGGCGAGAT TGTGATTGGC
AATAAAGTGA TTGCTAAAAT CAGCCGTACC GAGGTAGCGA CCAAGTTTAT GGAGCCCGCC
CCCGTTAAGA TGCTGCAACA ATTAGTGAAC GAAGGGCTTA TCAGCGAAGA GCAAATGTTG
ATGGCACAAT CTGTGCCCAT GGCCGATGAC ATTACCGCCG AAGCCGACTC TGGCGGCCAC
ACCGACAATC GCCCTCTGGT CACACTATTA CCGACCATTT TGGCGCTAAA AGATACCATT
CAAACCAAGT ACCAGTACAA AACGCCGATC CGTGTGGGCG CAGGTGGTGG TATCGGCACG
CCCGATGCGG CGCTGGCGAC CTTCAATATG GGCGCGGCCT ATATTGTCAC TGGCTCAATC
AACCAAGCCT GTGTTGAGGC TGGAGCCAGC GAACATACCC GTAAATTACT CGCCACCACT
GAAATGGCCG ATGTAACTAT GGCGCCCGCC GCCGATATGT TTGAAATGGG CGTTAAGTTA
CAAGTGGTTA AGCGCGGCAC TCTATTCCCA ATGCGCGCGA ACAAACTCTA TGAAATTTAC
ATCCGCTACG ACTCAATTGA GGCTATTCCG GCAGAGGAAA GACAAAAGCT GGAAGAGCAA
GTGTTTCGCG CGTCATTAGA TGAGATTTGG GCTGGCACTG TGGCGCACTT TAATGAACGC
GATCCTAAGC AAATTGAGCG CGCGCTGGAT AACCCAAAAC GCAAAATGGC GCTGATTTTC
CGTTGGTATT TGGGTCTTTC GAGCCGCTGG TCGAATACGG GCGAAGTTGG CCGTGAAATG
GATTACCAGA TTTGGGCAGG CCCCGCCCTC GGCGCCTTTA ATGCCTGGGC AAAAGGCAGT
TATTTAGATG ATTACCGCGA ACGCAATGCG GTGGATTTGA CGAAACATTT AATGCAAGGC
GCGGCCTACC AAGCACGGAT TAACCTGTTG TTATCCCAAG GGGTAAGTAT TCCAGTCAGC
CTGCAACGCT GGAAACCACT GCAGCGTTGC TAA
 
Protein sequence
MTNTTLDNKK LSPWPWQVDE AAISFDIDSL GKKLKDLNQA CYLVNHAEKG LGIAQSAEVV 
GLAEAQPSNG LHKSNGLHPV SAFAPALGTQ SLGDSNFRRV HGVKYAYYAG AMANGIASEE
LVIALGQAGI LCSFGAAGLI PSRVEAAIKR IQTALPNGPY AFNLIHSPSE PALERGSVEL
FLKHQVRTVE ASAFLGLTPQ IVYYRAAGLS RDANGEIVIG NKVIAKISRT EVATKFMEPA
PVKMLQQLVN EGLISEEQML MAQSVPMADD ITAEADSGGH TDNRPLVTLL PTILALKDTI
QTKYQYKTPI RVGAGGGIGT PDAALATFNM GAAYIVTGSI NQACVEAGAS EHTRKLLATT
EMADVTMAPA ADMFEMGVKL QVVKRGTLFP MRANKLYEIY IRYDSIEAIP AEERQKLEEQ
VFRASLDEIW AGTVAHFNER DPKQIERALD NPKRKMALIF RWYLGLSSRW SNTGEVGREM
DYQIWAGPAL GAFNAWAKGS YLDDYRERNA VDLTKHLMQG AAYQARINLL LSQGVSIPVS
LQRWKPLQRC