Gene Sala_2359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2359 
Symbol 
ID4080760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2484997 
End bp2486178 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content68% 
IMG OID638010739 
Productpeptidase M19, renal dipeptidase 
Protein accessionYP_617401 
Protein GI103487840 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.160596 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGCT GGTTGATCGC CCTTTTGATC GTCGTCGCGC TGGCCGCGCT CGCCTTTTTC 
ACGCTCGCGC CGGGGATGAT CGAGCGCGAC CTCAACCGGA TCGACGGCAA GCCGCTGCCG
CAGGTGACGG CGCGCGCAGA GGCGCTGCAC CAGACGCTCA CGATCGTCGA TCTGCACAGC
GACAGCCTGT TGTGGAGCCG CGATTTCCTG GATCGCGCCG AGCGCGGCCA TATGGACCTG
CCGCGGCTGA AGGACGGCCA TGTCGCGCTG CAGGTGCTCG CGAGCACGAC CAAATCGCCC
AAGGGGCAGA ATTACCACGC GAACGGCGCC GACAGCGACA ATATCACCGG CCTCGTGATC
GCGCAGCTCC AGCCGGTGCG GACGTGGACC TCGCTGCTCG AACGCTCGCT CTGGCACGCC
GAAAAGCTGC ACCGCGCGGC CGCGGCGTCG AACGGCACGC TGAAACCCGT CGCGACCACC
GCCGACCTCG ACGCGCTGCT CGCCGCGCGG CGCGGCAAGC CGCTCACCAC CGGCGCGCTG
CTCAGCGTCG AGGGGCTGCA CAATCTCGAA GGCGACATTG CCAATCTGGA CAAGCTCTAC
GCCGCGGGCT TCCGCATGGC GGGGCTCACC CATTTCTTCG ACAATGAACT CGCAGGCTCG
ATGCACGGGC TCAAGAAAGG CGGGCTCACC CCGCTGGGGC GGCAGGTCGT GACCGCGATG
GAGGCGAAGG GCATGATCGT CGACATCGCG CATTGCAGCG AGGCCTGCGT CGCCGACATA
TTGAAAATGG CGCGCCGCCC CGTCGTGTCC AGCCACGGCG GGGTGCAGGC AACGTGCAAG
GTCAACCGCA ACCTGTCGGA CGCGCAGATT CGCGGCGTCG CCGCAACCGG CGGCCTCGTC
GGCATCGGTT ACTGGGACGC CGCGGTGTGC GACACCTCGC CCGCGAGCAT CGCGCGCGCG
ATGAAGCACG TCCGCGACCT CGTCGGCATA AATCATGTCG CGCTCGGCAG CGATTATGAC
GGCGCCACCA CCGTGCGCTT CGACACCGCG CAGCTGGTGC AGGTGACGCA GGCGCTGATC
GACGCGGGCT TTTCCGACGA CGAAATCCGC GCCGCGATGG GCGGCAATGC GATCCGCGTG
CTGAAAGCGG GGCTGGTGCC CCTCACGCCG CCGGCGCCAT GA
 
Protein sequence
MRRWLIALLI VVALAALAFF TLAPGMIERD LNRIDGKPLP QVTARAEALH QTLTIVDLHS 
DSLLWSRDFL DRAERGHMDL PRLKDGHVAL QVLASTTKSP KGQNYHANGA DSDNITGLVI
AQLQPVRTWT SLLERSLWHA EKLHRAAAAS NGTLKPVATT ADLDALLAAR RGKPLTTGAL
LSVEGLHNLE GDIANLDKLY AAGFRMAGLT HFFDNELAGS MHGLKKGGLT PLGRQVVTAM
EAKGMIVDIA HCSEACVADI LKMARRPVVS SHGGVQATCK VNRNLSDAQI RGVAATGGLV
GIGYWDAAVC DTSPASIARA MKHVRDLVGI NHVALGSDYD GATTVRFDTA QLVQVTQALI
DAGFSDDEIR AAMGGNAIRV LKAGLVPLTP PAP