Gene Sala_2994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2994 
Symbol 
ID4082938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp3134588 
End bp3136414 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content67% 
IMG OID638011379 
Productpeptidase M24 
Protein accessionYP_618032 
Protein GI103488471 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.020362 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGCC CCGTCCACGC CGAGCGCCTC GCCCGCGTCC GTGCCGAATT GAAAGCGCGC 
GGCCTCGATG GCTTCATCGT GCCGATCAGC GACGAACATA TGAGCGAATA TGTCGGCGCC
TATGCGCAGC GCATGGCCTG GCTGACCGGC TTTGGTGGAT CGGCCGGGAC CGCGGCGGTC
CTGCCCGAAA AGGCGGCGGT GTTCGTCGAC GGCCGCTACA CCGTGCAGGT GCGCGACCAG
GTCGACGGGT CGCTCTTCGA CTATGTCGGG GTGCCGCAGT CGAGCGTCGC CGAATGGCTG
GGCAGCCATG TCAGCGCGGG GCAGAGGGTT GGTTATGACC CCTGGCTGCA CGGTATAGAC
TGGGTCCGCG GGCTGGAAAA GGCGCTGGCG GCGAAAGGTG CGAGCCTTGT CGCGGTCGAC
AAGAATCCGG TCGACGCGGC ATGGGACGAC CAACCCGCGC CGAGCAACGC GCCGGTGAGC
GTTTATGATA CGGCGCTCGC CGGACAGAGT GCGGTCGAGA AACGCGGTGT CATCGCCGAC
TGGCTGAAGG CGAAGGGGCT CGACACGACG GTGATGACCG CGCTCGATTC GATCGCCTGG
ACCTTCAATA TCCGTGGGGA GGACGTGAGC CACACGCCGG TCGGGCTGGC CTTTGCACTG
CTCCACGCCG ACGCCACCGC CGATCTGTTC ATCGCGCCCG AAAAAATCAC CGACGCGGTG
CGCGCGCATC TGGGCAACAG CGTGCGGATT CACGACCGCA GCGCCTTTGA AGGCGCGCTG
GCCGGGCTTG CGGGCAAGAA AGTCGCTGTC GATCCCGACC GCGCGGTCGC GGCGATCTTT
ACCGCGCTCG AAAACGCGGG TGTGCAGGTC GAACGGCACC GCGACCCCGC GGTGCTGCCC
AAGGCGATCA AGAATCAAGT CGAACTGAGC GGCACGCGCG CTGCGCACCT TCGCGACGGC
GTCGCGGTGT CGCGTTTCCT CAAATGGATG GAGGAGGTCG CGCCGCAGGG CGGCCTCGAC
GAGCTGGGCG CGGCGGCGAA GCTGCGCGAA TTTCGCGAGG CAGGCGGCGC GCTCAAGGAT
CTGTCGTTCG ACACCATTTC GGCGGCTGGC CCGAACGGCG CGCTGCCGCA TTACAAGGTC
GACGAAACCA CCAACCGCAG GATCGAGAGG GGCACGCTCT ATCTGGTCGA TTCGGGCGGA
CAATATGCCG ACGGCACGAC CGACATCACG CGCACGATCG CGATCGGGGC GCCCAGCGCC
GAAATGCGGC GCCGCTTCAC GCAGGTGCTG AAGGGTCATA TCGCGCTGGC CACCGCGCGC
TTTCCCAAGG GCACACGCGG CAGCCAGCTG GACATCCTCG CGCGCCAGTA TCTGTGGGCC
GACGGGGTCG ATTATGCGCA TGGCACCGGG CATGGCGTCG GCACCTATCT CGCGGTCCAC
GAAGGGCCGC AGCGGATCGC CAAGCCGGCG GGCGGACAGG CGGGGACCGA GGAGCCGCTG
CACGCGGGCA TGATCCTGTC GAACGAGCCC GGCTATTACA AGGCGGGGCA TTTCGGCATC
CGCATCGAAA ATCTGGTGAT CGTCGTGCCG CAAGAGATCG ACGGCGCCGA GGAAGAGATG
CTGGGGTTCG AGACGATCAC CTTTGCGCCG ATCGCGAGAG ATCTGGTCGA CGTGGCGCTG
CTGTCGTCCG CCGAGGCCGA CTGGCTCGAC GCCTATCATG CCGCGGTGTT CGAAAAGCTG
TCGCCGGGAA TGGACGAGGC GATGCGCGAC TGGCTTGCCG CCGCCTGCGC TCCGCTCGAC
CGCACCCCTG CCGCGCTCGC GGCCTGA
 
Protein sequence
MSSPVHAERL ARVRAELKAR GLDGFIVPIS DEHMSEYVGA YAQRMAWLTG FGGSAGTAAV 
LPEKAAVFVD GRYTVQVRDQ VDGSLFDYVG VPQSSVAEWL GSHVSAGQRV GYDPWLHGID
WVRGLEKALA AKGASLVAVD KNPVDAAWDD QPAPSNAPVS VYDTALAGQS AVEKRGVIAD
WLKAKGLDTT VMTALDSIAW TFNIRGEDVS HTPVGLAFAL LHADATADLF IAPEKITDAV
RAHLGNSVRI HDRSAFEGAL AGLAGKKVAV DPDRAVAAIF TALENAGVQV ERHRDPAVLP
KAIKNQVELS GTRAAHLRDG VAVSRFLKWM EEVAPQGGLD ELGAAAKLRE FREAGGALKD
LSFDTISAAG PNGALPHYKV DETTNRRIER GTLYLVDSGG QYADGTTDIT RTIAIGAPSA
EMRRRFTQVL KGHIALATAR FPKGTRGSQL DILARQYLWA DGVDYAHGTG HGVGTYLAVH
EGPQRIAKPA GGQAGTEEPL HAGMILSNEP GYYKAGHFGI RIENLVIVVP QEIDGAEEEM
LGFETITFAP IARDLVDVAL LSSAEADWLD AYHAAVFEKL SPGMDEAMRD WLAAACAPLD
RTPAALAA