Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_0937 |
Symbol | |
ID | 5197966 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | - |
Start bp | 1046200 |
End bp | 1047108 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640580483 |
Product | prolyl aminopeptidase |
Protein accession | YP_001261442 |
Protein GI | 148553860 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily [TIGR01250] proline-specific peptidases, Bacillus coagulans-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.163263 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGGAA TCGGAAGGGC GGTACTGGCG GGCCTGATGC TGGTCGCGAT GACCGGCGAG GCCGTCGCCG CCGACACGGC GAAGGCGTTC ACCGCGCCGA TCGAGGGCGG CACGCTCCAT TATGAGATGA TCGGCGGCGG CGACCAGCCG CCGCTGGTGC TAGTCAATGG CGGGCCGGGG CTCGACCATC GTTATTTCCA TGGCAGCCCG GTCTGGGAAG GCTTGTCGAA GCGCCGCCCG GTCGTCTTCT ACGACCAGCG CGGCATGGGA CGCACCACCT CGACCATCGC CGTCGACCGG TTCACGGTCG ACATGATGGT CGCCGACCTG GAGGCGCTGC GGGTCAGGCT CGGCGTTCCG AAGATCGCGC TGCTCGGCCA TAGCTGGGGC GGGCTGCTGT CGATGGCCTA TGCGACACGC CATCCCGACC ATGTCTCCCG CCTCGTCCTG GTCGGGTCGG GCGCTCCGAA GATCGCGGCG CACGAATATC TGTTCGACAA GCTCTATCCC GAGATCGCGG CCCGGCAGGT GCCCGACGAC AGCCCGGCGG CGAAGATGGG ATGCAAGGCC GACAGCCTCG AGGACTATGG GCGGATGGCC TATTACGACC AGCGCAACCA GCCGCGCCTG GCGGCCGAGG ACAACAGCGC CTTCTCGCAG GAGGTCTGCA CCGCGGTCAT GCTCGACGCG ATGAAGCTCG ACCTCTTCCC CAGGCTCCGT ACGCTCCATG TGCCGACGCT GGTGATCAAC GGCCGCTTCG ACGCCAATGT CGCGCCGACG GTCGCCTATG CGATCAGCAA GGCGATCCCC GGCGCGACGC TCGACTATTT CGAGCATAGC GGCCACCAGC CGTTCGAAGA GGAGCCCGAC CGGTTCGAAC TGGTGGTCGA GCGCTTCCTC GATCAATAG
|
Protein sequence | MRGIGRAVLA GLMLVAMTGE AVAADTAKAF TAPIEGGTLH YEMIGGGDQP PLVLVNGGPG LDHRYFHGSP VWEGLSKRRP VVFYDQRGMG RTTSTIAVDR FTVDMMVADL EALRVRLGVP KIALLGHSWG GLLSMAYATR HPDHVSRLVL VGSGAPKIAA HEYLFDKLYP EIAARQVPDD SPAAKMGCKA DSLEDYGRMA YYDQRNQPRL AAEDNSAFSQ EVCTAVMLDA MKLDLFPRLR TLHVPTLVIN GRFDANVAPT VAYAISKAIP GATLDYFEHS GHQPFEEEPD RFELVVERFL DQ
|
| |