Gene Sala_2312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2312 
Symbol 
ID4080597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2439711 
End bp2440970 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content66% 
IMG OID638010692 
Productmembrane dipeptidase 
Protein accessionYP_617354 
Protein GI103487793 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2355] Zn-dependent dipeptidase, microsomal dipeptidase homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAG CCTCCTTGCT CGTCAGCCTC GCCGCGCTCG CCCTTGTTTC CAGCCCCGTC 
GCCGCCCAGA CCTCGCCCGA GGCGGTGGCC GCGGCCGCGC TCAAGAAGGC GCCGGTGTTC
GACGGGCATA ATGACGTGCC CTGGGCGCTG CGCGCGCGCG TCGATAACGT CATCAATGAT
TTCGATTTCG TCGACACGAC CGATACCGCG ACCGGGGACC GGATCGCGAT GCACACCGAC
CTCACCCGGC TGCGCCGCGG GCATGTCGGC GCGCAATTCT GGTCGGTTTA CGTTCCCTCG
ACCACCAACG AGGCGAAAGC GGTGCAGCAG ACGATCGAGC AGATCGACGT GATGAAGCGG
CTGGTCGCGC GCTATCCCGC CGACCTGATG CTCGCCGACA ATTCCGCCGA GCTGGAAAAG
GCGATGAAGG CGGGCAAGGT CGCCGGGATG CTGGGGATCG AGGGCGGGCA TTCGATCGGG
TCGAGCCTGG CGGTGCTGCG CGAAATGTAT GGCATGGGCG TACGCTATAT GACGCTGACC
CACGGCAGAA ATGTGCCATG GGCCGACAGC GCGACCGACG CACCGGAGCA TGGCGGCCTC
ACCGATTTCG GGCGCCAGGT GGTCCAGGAA ATGAACCGCA TCGGCATGAT CGTCGATCTG
AGCCACGTCA GCGAGGCGAC GATGAAGGAT GCGCTCGCGG CGTCGAAGGC GCCGGTGATG
TTCAGCCATT CGGGCGTGCG CGCGATAAAC GATCATCCGC GCAATGTCCC CGACAGCGTG
CTGCCCGCGG TGAAGGCCAA TGGCGGGATC GTGATGGTGG TGTTCCTGCC GGGCTTCCTC
GACGCCGATG TCCGCGCGCA TGGCCTCGAC CGCACTGGCG TGGAGGCGCG GCTGAAGGCG
ATGTATCCGG GCGATCCCGC GGCGGTTGCG GCGGCGCTCA CGGCGTGGGA CGCTGCGAAC
CCCGCCCCGA AAACGCAGAT TGCCAGGGTC GCCGACCATA TCGACCATCT GAAACACATG
ATCGGCGTCG ACCATATCGG ACTCGGCGGC GACTATGACG GTATGGATTC GGCGCCCGTG
GGCATGGAGG ATGTCGCGGG CTATCCGGCG CTGTTCGTCG AGCTGGCGCG GCGCGGCTAT
TCGCAGGCCG AGCTGGAGAA GATTGCGAGC GGCAACATGC TGCGCGTGCT GAAGGCGGTC
GAGGCCTTTG CCGCAAGCCA GAAGGGTCAG CCGCCGGTCG AAACGCCGGT GGCGAAATAG
 
Protein sequence
MNKASLLVSL AALALVSSPV AAQTSPEAVA AAALKKAPVF DGHNDVPWAL RARVDNVIND 
FDFVDTTDTA TGDRIAMHTD LTRLRRGHVG AQFWSVYVPS TTNEAKAVQQ TIEQIDVMKR
LVARYPADLM LADNSAELEK AMKAGKVAGM LGIEGGHSIG SSLAVLREMY GMGVRYMTLT
HGRNVPWADS ATDAPEHGGL TDFGRQVVQE MNRIGMIVDL SHVSEATMKD ALAASKAPVM
FSHSGVRAIN DHPRNVPDSV LPAVKANGGI VMVVFLPGFL DADVRAHGLD RTGVEARLKA
MYPGDPAAVA AALTAWDAAN PAPKTQIARV ADHIDHLKHM IGVDHIGLGG DYDGMDSAPV
GMEDVAGYPA LFVELARRGY SQAELEKIAS GNMLRVLKAV EAFAASQKGQ PPVETPVAK