Gene Sala_2006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2006 
Symbol 
ID4079943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2115059 
End bp2116153 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content65% 
IMG OID638010382 
Productproline iminopeptidase 
Protein accessionYP_617050 
Protein GI103487489 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.749517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00407135 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGTCATGG ATTTTTCGCG GCTTCAGGCG TCGAGCAAGA TCGGCGAGCA GTGGGTCTAT 
CCGCAGCCCG CCTGCCTCAA TTTCGGGTGG CTGGAGGTCG ACCGCGATCC CGCGCACCGC
CTTTACTGGG AGGAATATGG CAATCCCGCG GGCGAGCCGG TGATGGTCCT GCACGGCGGC
CCCGGCGGCG CGTGCGCGCC GGTGATGGCG CGCTTTTTCG ATCCGAAGCG ATACCGGGTG
ATCCTGTTCG ACCAGCGCGG GTGCGGCAAG AGCGAGCCCA ATGTCGCGTC GGCCGGGCCG
GCGGTCGCGC TGGCCAAAAA CACCACCGCC GACCTGATCG GCGACATCGA GAAATTGCGC
GATCATCTGG CGATTGCGGG GCCGATGCAC GTCTTTGGCG GCAGCTGGGG CAGCACGCTG
GCCATGGCCT ATGCGATCCA GCATCCCGCG CACTGCGCCA GCCTGATCCT GCGCGGCATC
TTTCTGGGCG CGGCGGAGGA TCTGCTTTAC CTCTATCAGG GCAATGCCGC GACGTGGGGA
GACGACCCGT TCGCGCTGAC CGCGCCCGGC GCCTATATCA AATATCCCGA CCAATGGGCG
GCGCTGCTCT CGGTGCTGAG CGCCGACGAG CGGCGCGATG TCATGGCGTC GTACAAGGCG
ATTTTCGATA TGGTGCCGGC GAATGCGGCG GAGAAGGAGC GGCAGCTGAA CGCCGCGCTC
ACCTGGTCGC TATGGGAAGG GGTGATTTCC AACATGATCC CCGAGACGGC CGACACGGGC
AAGTTCGGCG AGGCCGATTT CGCGCTGTGC TTCGCGCAGA TCGAGGCGCA TTATTTCGCC
AACGACCTGT TCCTGCCCGC GGGCCATTTT TTCGACCATA TCGACATACT GGCGTCGATC
CCCATCCACA TCGTCCACGG CCGTTTCGAC GAAGTCTGCC CGCTGACACA GGCATCGCGG
CTGGTCGCCG CGCTGCGCGC CGCGGGGGCG GAGCCGGTGT CCTATGTCGT CACCAATGCG
GGGCACAGCG CGATGGAGCG CGAGAATGCG CTGGCGCTGA CGGCGGTGAT GGATGGGTTG
GGGAGGATTG TATAA
 
Protein sequence
MVMDFSRLQA SSKIGEQWVY PQPACLNFGW LEVDRDPAHR LYWEEYGNPA GEPVMVLHGG 
PGGACAPVMA RFFDPKRYRV ILFDQRGCGK SEPNVASAGP AVALAKNTTA DLIGDIEKLR
DHLAIAGPMH VFGGSWGSTL AMAYAIQHPA HCASLILRGI FLGAAEDLLY LYQGNAATWG
DDPFALTAPG AYIKYPDQWA ALLSVLSADE RRDVMASYKA IFDMVPANAA EKERQLNAAL
TWSLWEGVIS NMIPETADTG KFGEADFALC FAQIEAHYFA NDLFLPAGHF FDHIDILASI
PIHIVHGRFD EVCPLTQASR LVAALRAAGA EPVSYVVTNA GHSAMERENA LALTAVMDGL
GRIV