Gene Swit_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_2031 
Symbol 
ID5200053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp2276593 
End bp2277621 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content71% 
IMG OID640581575 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_001262528 
Protein GI148554946 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAAG GACACCATGG CCCACAATCC CTCCGAAGCC GCGCCGGTCG AACCATGAGC 
CTCCGCACGC GCATCACCGA GATGTTCGGG ATCGAACACC CCATCCTGCA GGGCGGGATG
CAGTGGGTCG GGCGCGCGGA CCTCGTCTCG GCCGTCGCCA ATGCCGGCGG GCTGGGCTTC
ATCACCGCGC TGACCCAGCC CAACCCGGAG GCCCTGGCGC GGGAGATCGC GCGCTGCCGG
TCGATGACCG ACAAGCCGTT CGGGGTGAAC CTCACCATCC TGCCGACGCT GTCGCCGCCG
CCCTATGCGG AGTATCGCGC CGCGATCATC GAAAGCGGGG TGAGGATCGT CGAGACCGCC
GGCTACCGCC CGCAGGAACA TGTCGACGAC TTCAAGCGCC ACGGCATCAA GGTCATCCAC
AAATGCACCG CCGTGCGCCA CGCCCTTTCG GCCGAGCGGA TGGGCGTCGA CGCGATCTCG
ATCGACGGCT TCGAATGCGC CGGCCACCCC GGCGAGGACG ATGTCCCGGG GCTGATCCTG
ATCCCCGCGG CGGCGGACAG GGTCCGCATC CCGCTCGTCG CGTCGGGCGG CTTCGCCGAC
GGCCGGGGGC TGGTCGCGGC GCTGGCGCTG GGCGCCGAGG GGATCAACAT GGGCACCCGC
TTCCTCGCCA CGCGCGAGGC CCCGATCCAT GATAATTTCA AGGCGGCGCT GGTGGCCGGC
GACGAACGAT CGACCGAGCT GATCTTCAGG ACCTACCGCA ACACCGCCCG CGTCCGCAGG
AACGCCGTCA GCACCGAGGT GCGGCGGCTG GAGGCGCTCG GCGAACCGTT CGAGGCGGTG
GCGCCGCTGG TGAAGGGCGC GCGCGGGCGC GAGGGGCTCG AAACCGGCGC GACCGACCAT
GGCGTCTTCA CCGCCGGCCT CGCCCAGGCC CTGATCCAAG ACGTCCCGTC CGTCGCGGAG
CTGATCGACC GCATCATGCG GGAAGCCGCC GAGATCATCG GCGCGCGCCT GGGCGGCCTG
CGAAGCTGA
 
Protein sequence
MPEGHHGPQS LRSRAGRTMS LRTRITEMFG IEHPILQGGM QWVGRADLVS AVANAGGLGF 
ITALTQPNPE ALAREIARCR SMTDKPFGVN LTILPTLSPP PYAEYRAAII ESGVRIVETA
GYRPQEHVDD FKRHGIKVIH KCTAVRHALS AERMGVDAIS IDGFECAGHP GEDDVPGLIL
IPAAADRVRI PLVASGGFAD GRGLVAALAL GAEGINMGTR FLATREAPIH DNFKAALVAG
DERSTELIFR TYRNTARVRR NAVSTEVRRL EALGEPFEAV APLVKGARGR EGLETGATDH
GVFTAGLAQA LIQDVPSVAE LIDRIMREAA EIIGARLGGL RS