Gene Smed_5538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5538 
Symbol 
ID5319840 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp502456 
End bp503550 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content62% 
IMG OID640777289 
Product2-nitropropane dioxygenase NPD 
Protein accessionYP_001314221 
Protein GI150377626 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.485511 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0535988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAACGT GGCATGACCG CCGAATCCTC GACCTTCTGG GAGTGGAGAT TCCCGTCATT 
CAGGCGCCGA TGGCCGGAGC GACGACGGCA GAAATGGTGA TTGCCGCTTC TGAAGCTGGC
GGCTTGGGTT CGTTGCCGAG CGCGCAATAT TCGGTCCACC AATTGCACGA GGCGCTCTCG
CAGATTACTG CGAGAACAAC CAGGTCAATC AATGTAAATT TCTTCAGCCA CGTGAAACCT
GATGCTGATC CTGCTGGTCA GATGAGGTGG CGGGCACTCT TGGCGCCGTA TTTCGTCGAA
CTTGGCCTTG ACCCCGCTGC CCCGATCAGC GGCCCCGGAC GTGCGCCCTT CGACAACGAG
TTCTGCGAGG TCGTCGAAGA GTTCCGCCCC AAGGTGGTGA GCTTCCATTT CGGTCTTCCC
GATCGAAGAC TCGTTGATAG GGTAAAGGCC GCGGGGGCCA AAGTGCTGTC GTCCGCGACG
ACCGTCGCCG AGGCCGTCTG GCTCGAGGCA CATGGTGTTG ATGCCGTGAT CGCAATGGGT
TTCGAGGCTG GCGGGCATCG CGGAAACTTT CTTACGCAGG ACATGACAAC CCAAGTGGGA
ACGATGGCGC TCATTCCGCA GGTCGTGGAC GCGGTTAAGG TTCCGGTCAT TGCTGTCGGG
GGTATCGCAG ATGGCCGCGG AGTTGCGGCG GCGTTGATGC TTGGAGCGTC AGCGGTGCAG
ATCGGTTCTG CTTACCTTCT AACTCCAGAG GCCAAAATTC CGGATCTGCA CGCCGATGCT
CTGGGTCGTG CCGGCGACGC CAGTACCGCC ATCACCAATG TCTTTACAGG AAGGCCCGCG
AGAGGCGTCG TAAACCGACT GATGCGAGAA CTAGGTCCGC TTTCGGACGT GGCGCCCGCC
TTTCCGACCG CCGGAGTGGC GCTCGCCGCG ATCCGCGCGA GGGCGGAGGA GGAGGGACGC
GATGACTTCA CCAACCTCTG GGCGGGGCAG GCCCTTGGTT TGGCGAGGCG GCTTCCCTCA
GCGGAACTCA CCGTGAAGTT GTTCGAGGAC GCGATGGCAG CCCTGGGGGC GGGTTCGGCG
ATCAGGCGAC TGTAG
 
Protein sequence
MRTWHDRRIL DLLGVEIPVI QAPMAGATTA EMVIAASEAG GLGSLPSAQY SVHQLHEALS 
QITARTTRSI NVNFFSHVKP DADPAGQMRW RALLAPYFVE LGLDPAAPIS GPGRAPFDNE
FCEVVEEFRP KVVSFHFGLP DRRLVDRVKA AGAKVLSSAT TVAEAVWLEA HGVDAVIAMG
FEAGGHRGNF LTQDMTTQVG TMALIPQVVD AVKVPVIAVG GIADGRGVAA ALMLGASAVQ
IGSAYLLTPE AKIPDLHADA LGRAGDASTA ITNVFTGRPA RGVVNRLMRE LGPLSDVAPA
FPTAGVALAA IRARAEEEGR DDFTNLWAGQ ALGLARRLPS AELTVKLFED AMAALGAGSA
IRRL