Gene Sala_1638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_1638 
Symbol 
ID4080716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp1720378 
End bp1721358 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content65% 
IMG OID638010011 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_616684 
Protein GI103487123 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.397206 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.560122 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTCA AGACGCGGAT TACGGAGATG CTGGGGATTG CGCATCCGAT CGTCCAGGGG 
GGGATGCAGA GCGTGGGCTA TGCCGAACTG GCGAGTGCGG TGTCGAACGC GGGCGGGCTT
GGCATATTGA CCGCGCTGAC GCAGCCGGAC CCTGGGGCAT TGCGCGCCGA GATCGAGCGC
TGCCGCGCGA TGACCGACAA GCCGTTCGGC GTGAACCTGA CTGTATTTCC GACGATCAAC
GCCCCCGACT ACAAGGCCTA TGCGCAGGCG ATCATCGACG GCGGGGTCAA GATCGTCGAG
ACCGCGGGCA CGCAGGCGGT GCGCGAGATA TGGGAGATGC TGAAGCCGCA CGGGGTCACC
ATCCTCCACA AATGCACCGC GGTGCGCCAC GCGCTGTCGG CCGAGCGCGC GGGCTGCGAC
ATCATTTCGA TCGACGGCTT CGAATGCGCG GGCCACCCCG GCGAGGACGA TGTTCCCGGC
CTGATCCTGA TCCCGGCCGC CGCCGACAAG GTGAAGATCC CGATGCTCGC CTCGGGCGGC
TTCGGCGACG GGCGGGGGCT CGTCGCGGCG CTGTCGCTCG GCGCCGAAGG CATCAACATG
GGCACACGCT TCTGCGCGAC GGTCGAGGCG CCGATCCACG ACAATGTCAA ACAGGCCTAT
ATCGACAATG ACGAGCGCGG CAGCTTCCTG ATCTTCCGCA GCCTGAAAAA CACCGCGCGG
GTCGGCAAAA ACGCGGTCAG CGAGGAGGTC GTGCGCCGCC TTTCGGTTCC CGGCGCCACC
TTCGCCGACG TGGCCGAACT GGTCAACGGC AAGGCAGGTC GCGAACTGCT CGAAACCGGC
GACCTTTCCA GGGGCGTGTT CTGGGCCGGA ATGGTCCAGG GGCTCATCCA CGACATCCCA
ACATGCCAGC AACTCGTCGA ACGCATCATC AAGGAAGCAC AAGATATCCT CGACCAAAGA
CTCGCCGGGT TCAGAAGGTA G
 
Protein sequence
MAFKTRITEM LGIAHPIVQG GMQSVGYAEL ASAVSNAGGL GILTALTQPD PGALRAEIER 
CRAMTDKPFG VNLTVFPTIN APDYKAYAQA IIDGGVKIVE TAGTQAVREI WEMLKPHGVT
ILHKCTAVRH ALSAERAGCD IISIDGFECA GHPGEDDVPG LILIPAAADK VKIPMLASGG
FGDGRGLVAA LSLGAEGINM GTRFCATVEA PIHDNVKQAY IDNDERGSFL IFRSLKNTAR
VGKNAVSEEV VRRLSVPGAT FADVAELVNG KAGRELLETG DLSRGVFWAG MVQGLIHDIP
TCQQLVERII KEAQDILDQR LAGFRR