Gene Sare_2835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2835 
Symbol 
ID5708009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3219179 
End bp3220300 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content72% 
IMG OID641272291 
Product2-nitropropane dioxygenase NPD 
Protein accessionYP_001537661 
Protein GI159038408 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0500011 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGACTG AGCTGTGCGA GCGGTTCGGC ATCGAGTACC CGTTCGTCGG CTTCAGCCCA 
TCCGAGCACG TGGTCGCCGC GATCAGCCGG GCCGGCGGTC TGGGCGTGCT GGGCTGCGTG
CGGTTCAATG ACCCGGACGA ACTCGACGCG GTACTCACCT GGCTCGACGA CCGGACCGAG
GGACGGCCGT ACGGGGTGGA CGTGGTGATG CCCAGCAGCG TGCCGGCCGA GGGCGCTCCC
GCTGATCTCG ACCGGCTGAT TCCGGCCGGG CACCGTGACT TCGTCGAGCG GACCCTGCTG
CGCCTCGGAG TGCCGCCGCT TGGTGTCGAC AATTCCCAGC GGGCCGGGGT GCTCGGCTGG
CTGCACTCCG TGGCCCGTTC GCACGTCGAG GTGGCGTTGA CCCACCCGGT TCGACTGGTC
GCCAACGCCC TCGGCCCGCC GCCACCCGAC GTGATCGCCC AGGCGCACGA GCGGGGCGTG
GTGGTGGCCG CGTTGGCCGG CCGGGCCGAC CATGCCCGAG GCCACGTGGC GAGCGGGGTC
GACCTGGTGG TGGCGCAGGG CTACGAGGCC GGCGGCCACA CGGGTGAGAT CGCCAGCATG
GTGCTGGTGC CGGAAGTGGT CGACGCGGTG GGTGCGCAGG TGCCGGTGCT CGCCGCGGGC
GGCATCGGTA GCGGCCGGCA GATCGCGGCG GCGCTCGCGC TCGGCGCGTG CGGTGTGTGG
ATGGGGTCGG TCTGGCTCGG CACCGCCGAA TACCAGAGCA GCGCCGCGTT ACGCGAGGCC
CTGCTGCGGG CCGGGTCAGC GGACACGGTA CGTAGCCGCG TCTATACCGG TAAGCCGGCC
AGACTGCTAC GAAATCGGTG GACCGACGCC TGGAGTGAGG AGGCTGCGCC CCGGCCGCTG
CCGATGCCAC TGCAGAATCT GCTGGTGGCC GAGGCACACA CCCGGCTCAT GGCTTCCGAC
GATCCGACTG TCGTCCCGAT GCCGGTCGGG CAGATCGTGG GTCGGATGAA CGAGGTGCGT
CCGGTCGCGG ATGTCCTCGC GGACCTGGCT GCCGAGGCGG ACGAGACGTT GGCCCGGCTT
GGGACGCTGT CCTGGCGGCG GCTGCCCCCG GGAAGCGGAT GA
 
Protein sequence
MRTELCERFG IEYPFVGFSP SEHVVAAISR AGGLGVLGCV RFNDPDELDA VLTWLDDRTE 
GRPYGVDVVM PSSVPAEGAP ADLDRLIPAG HRDFVERTLL RLGVPPLGVD NSQRAGVLGW
LHSVARSHVE VALTHPVRLV ANALGPPPPD VIAQAHERGV VVAALAGRAD HARGHVASGV
DLVVAQGYEA GGHTGEIASM VLVPEVVDAV GAQVPVLAAG GIGSGRQIAA ALALGACGVW
MGSVWLGTAE YQSSAALREA LLRAGSADTV RSRVYTGKPA RLLRNRWTDA WSEEAAPRPL
PMPLQNLLVA EAHTRLMASD DPTVVPMPVG QIVGRMNEVR PVADVLADLA AEADETLARL
GTLSWRRLPP GSG