Gene Sros_5685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSros_5685 
Symbol 
ID8668979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptosporangium roseum DSM 43021 
KingdomBacteria 
Replicon accessionNC_013595 
Strand
Start bp6218481 
End bp6219911 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content76% 
IMG OID 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_003341176 
Protein GI271966980 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.140335 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.54089 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTGAGG CCCGGCCGCC CCTTCTCATC CAGGGCGGGA TGGGGGTCGG CGTGTCCGGG 
TGGCGGCTGG CGCGGGCCGT CGCCCGGACC GGACAGCTCG GCGTGGTGTC CGGGACGGCA
CTGGACGTCG TGCTGGCCAG GCGGCTGCAA GGCGGAGATC CGGGCGGGCA CCTGCGCAGG
GCGCTGGCAC GCTTTCCGGC CCCCGAGGTC GCCGAACGGG TCCTGTCCCG GTATTTCGTC
CCCGGCGGCG CCGGGGACGG CCTGCCGTAC CGGCCGGTGC CCCGGCTCGG CCTGCGCTCG
CACCGGGTCC GGGACGAGCT CACGGTGGTC GCGAACTTCG CCGAGGTGTT CCTCGCGAAG
GAGGGGCACG AGGGCCCGAT CGGGATCAAC TATCTGGAGA AGATCCAGAT GGCCACGCCC
GCCGCCGTCT ACGGAGCGAT GCTCGCCGGT GCCGACTACG TGCTGATGGG GGCGGGCATC
CCCTCGGAGA TCCCGCGGCT GCTCGACGCG CTCGCCGCGC ACCGGCCGGC GCGGATATCG
GTCGCGGTGG CGGAGGCCGA CGCGGAGGAC CGCCACACCG TCGGCATCGA CCCGGTGGCG
CTGCTCGGCC GCACGCCCGG ACCGCTGGAG CGGCCCCGGC TGCTGGCCAT CGTCTCCTCG
CACGTCCTGG CCGCCTACCT CGCCCGCTCC CCGCAGACCC GTCCGGACGG GTTCGTGCTG
GAGTCGCCGG TGGCCGGCGG GCACAGTGCG CCGCCCCGGG GCAGGATGCG GCTCGACGCC
GTCGGCGAGC CGGTCTACGG CCCGCGCGAC GAGGTCGACA CCGGCAAGAT CGCCGCGCTC
GGGCTGCCGT TCTGGCTGGC GGGCGGCTAC GCGACCCCGG ACGGGCTGGT ACGGGCCGTG
CGGGCCGGGG CCGCCGGGAT CCAGCTGGGG ACGGCCTTCG CGCTGTGCCG GGAGTCGGGC
CTGGACGACA CGCTCAGGCG GCGCCTGCTC GGGCGCGCGT CAAGCGGGGG CCTGGAGGTC
CGCAACGACC CGCGCGCCTC GCCGGCGGGC TTCCCCTTCA AGATCGCTGA GCTGCCGGGA
ACCCTGTCCG GCCCGGACGT CTACGGCGAC CGCCCCCGCC TGTGCGACCT GGGCCACCTG
CGCACGCCGT ACCGCAAGGA GGACGGCGCG GTCGGCTACC GCTGCCCCGC CGAGCCGGTC
GACACGCACG TCCGCAAGGG CCGGCCCGTC GAGGACACCG TGGAGCGCCG CTGCCTGTGC
AACGGGCTGC TGTCCGCCAT CGGCCTCGGG CAGCGCCGCC CGGACGGCTA CCGGGAGCCC
CCGCTGCTCA CCCTCGGCCA GGATCTCGGG TTCCTCGACG AGCTGCCGGA GGACTACTCG
GCCGCCGACG TCGTCGACCA CATCCTCTCC GGGGTGAGGA GCGCGGGCTG A
 
Protein sequence
MSEARPPLLI QGGMGVGVSG WRLARAVART GQLGVVSGTA LDVVLARRLQ GGDPGGHLRR 
ALARFPAPEV AERVLSRYFV PGGAGDGLPY RPVPRLGLRS HRVRDELTVV ANFAEVFLAK
EGHEGPIGIN YLEKIQMATP AAVYGAMLAG ADYVLMGAGI PSEIPRLLDA LAAHRPARIS
VAVAEADAED RHTVGIDPVA LLGRTPGPLE RPRLLAIVSS HVLAAYLARS PQTRPDGFVL
ESPVAGGHSA PPRGRMRLDA VGEPVYGPRD EVDTGKIAAL GLPFWLAGGY ATPDGLVRAV
RAGAAGIQLG TAFALCRESG LDDTLRRRLL GRASSGGLEV RNDPRASPAG FPFKIAELPG
TLSGPDVYGD RPRLCDLGHL RTPYRKEDGA VGYRCPAEPV DTHVRKGRPV EDTVERRCLC
NGLLSAIGLG QRRPDGYREP PLLTLGQDLG FLDELPEDYS AADVVDHILS GVRSAG