Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sala_2006 |
Symbol | |
ID | 4079943 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingopyxis alaskensis RB2256 |
Kingdom | Bacteria |
Replicon accession | NC_008048 |
Strand | - |
Start bp | 2115059 |
End bp | 2116153 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 638010382 |
Product | proline iminopeptidase |
Protein accession | YP_617050 |
Protein GI | 103487489 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.749517 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.00407135 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGTCATGG ATTTTTCGCG GCTTCAGGCG TCGAGCAAGA TCGGCGAGCA GTGGGTCTAT CCGCAGCCCG CCTGCCTCAA TTTCGGGTGG CTGGAGGTCG ACCGCGATCC CGCGCACCGC CTTTACTGGG AGGAATATGG CAATCCCGCG GGCGAGCCGG TGATGGTCCT GCACGGCGGC CCCGGCGGCG CGTGCGCGCC GGTGATGGCG CGCTTTTTCG ATCCGAAGCG ATACCGGGTG ATCCTGTTCG ACCAGCGCGG GTGCGGCAAG AGCGAGCCCA ATGTCGCGTC GGCCGGGCCG GCGGTCGCGC TGGCCAAAAA CACCACCGCC GACCTGATCG GCGACATCGA GAAATTGCGC GATCATCTGG CGATTGCGGG GCCGATGCAC GTCTTTGGCG GCAGCTGGGG CAGCACGCTG GCCATGGCCT ATGCGATCCA GCATCCCGCG CACTGCGCCA GCCTGATCCT GCGCGGCATC TTTCTGGGCG CGGCGGAGGA TCTGCTTTAC CTCTATCAGG GCAATGCCGC GACGTGGGGA GACGACCCGT TCGCGCTGAC CGCGCCCGGC GCCTATATCA AATATCCCGA CCAATGGGCG GCGCTGCTCT CGGTGCTGAG CGCCGACGAG CGGCGCGATG TCATGGCGTC GTACAAGGCG ATTTTCGATA TGGTGCCGGC GAATGCGGCG GAGAAGGAGC GGCAGCTGAA CGCCGCGCTC ACCTGGTCGC TATGGGAAGG GGTGATTTCC AACATGATCC CCGAGACGGC CGACACGGGC AAGTTCGGCG AGGCCGATTT CGCGCTGTGC TTCGCGCAGA TCGAGGCGCA TTATTTCGCC AACGACCTGT TCCTGCCCGC GGGCCATTTT TTCGACCATA TCGACATACT GGCGTCGATC CCCATCCACA TCGTCCACGG CCGTTTCGAC GAAGTCTGCC CGCTGACACA GGCATCGCGG CTGGTCGCCG CGCTGCGCGC CGCGGGGGCG GAGCCGGTGT CCTATGTCGT CACCAATGCG GGGCACAGCG CGATGGAGCG CGAGAATGCG CTGGCGCTGA CGGCGGTGAT GGATGGGTTG GGGAGGATTG TATAA
|
Protein sequence | MVMDFSRLQA SSKIGEQWVY PQPACLNFGW LEVDRDPAHR LYWEEYGNPA GEPVMVLHGG PGGACAPVMA RFFDPKRYRV ILFDQRGCGK SEPNVASAGP AVALAKNTTA DLIGDIEKLR DHLAIAGPMH VFGGSWGSTL AMAYAIQHPA HCASLILRGI FLGAAEDLLY LYQGNAATWG DDPFALTAPG AYIKYPDQWA ALLSVLSADE RRDVMASYKA IFDMVPANAA EKERQLNAAL TWSLWEGVIS NMIPETADTG KFGEADFALC FAQIEAHYFA NDLFLPAGHF FDHIDILASI PIHIVHGRFD EVCPLTQASR LVAALRAAGA EPVSYVVTNA GHSAMERENA LALTAVMDGL GRIV
|
| |