Gene Swit_0937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_0937 
Symbol 
ID5197966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp1046200 
End bp1047108 
Gene Length909 bp 
Protein Length302 aa 
Translation table11 
GC content67% 
IMG OID640580483 
Productprolyl aminopeptidase 
Protein accessionYP_001261442 
Protein GI148553860 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily
[TIGR01250] proline-specific peptidases, Bacillus coagulans-type subfamily 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.163263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGGAA TCGGAAGGGC GGTACTGGCG GGCCTGATGC TGGTCGCGAT GACCGGCGAG 
GCCGTCGCCG CCGACACGGC GAAGGCGTTC ACCGCGCCGA TCGAGGGCGG CACGCTCCAT
TATGAGATGA TCGGCGGCGG CGACCAGCCG CCGCTGGTGC TAGTCAATGG CGGGCCGGGG
CTCGACCATC GTTATTTCCA TGGCAGCCCG GTCTGGGAAG GCTTGTCGAA GCGCCGCCCG
GTCGTCTTCT ACGACCAGCG CGGCATGGGA CGCACCACCT CGACCATCGC CGTCGACCGG
TTCACGGTCG ACATGATGGT CGCCGACCTG GAGGCGCTGC GGGTCAGGCT CGGCGTTCCG
AAGATCGCGC TGCTCGGCCA TAGCTGGGGC GGGCTGCTGT CGATGGCCTA TGCGACACGC
CATCCCGACC ATGTCTCCCG CCTCGTCCTG GTCGGGTCGG GCGCTCCGAA GATCGCGGCG
CACGAATATC TGTTCGACAA GCTCTATCCC GAGATCGCGG CCCGGCAGGT GCCCGACGAC
AGCCCGGCGG CGAAGATGGG ATGCAAGGCC GACAGCCTCG AGGACTATGG GCGGATGGCC
TATTACGACC AGCGCAACCA GCCGCGCCTG GCGGCCGAGG ACAACAGCGC CTTCTCGCAG
GAGGTCTGCA CCGCGGTCAT GCTCGACGCG ATGAAGCTCG ACCTCTTCCC CAGGCTCCGT
ACGCTCCATG TGCCGACGCT GGTGATCAAC GGCCGCTTCG ACGCCAATGT CGCGCCGACG
GTCGCCTATG CGATCAGCAA GGCGATCCCC GGCGCGACGC TCGACTATTT CGAGCATAGC
GGCCACCAGC CGTTCGAAGA GGAGCCCGAC CGGTTCGAAC TGGTGGTCGA GCGCTTCCTC
GATCAATAG
 
Protein sequence
MRGIGRAVLA GLMLVAMTGE AVAADTAKAF TAPIEGGTLH YEMIGGGDQP PLVLVNGGPG 
LDHRYFHGSP VWEGLSKRRP VVFYDQRGMG RTTSTIAVDR FTVDMMVADL EALRVRLGVP
KIALLGHSWG GLLSMAYATR HPDHVSRLVL VGSGAPKIAA HEYLFDKLYP EIAARQVPDD
SPAAKMGCKA DSLEDYGRMA YYDQRNQPRL AAEDNSAFSQ EVCTAVMLDA MKLDLFPRLR
TLHVPTLVIN GRFDANVAPT VAYAISKAIP GATLDYFEHS GHQPFEEEPD RFELVVERFL
DQ