Gene Strop_2777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2777 
Symbol 
ID5059240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp3148774 
End bp3149694 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content70% 
IMG OID640475031 
Productproline-specific peptidase 
Protein accessionYP_001159597 
Protein GI145595300 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01250] proline-specific peptidases, Bacillus coagulans-type subfamily 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.159674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000853285 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGACGACAA CTGGAGTGGA CGCCATGACA CTGGCACCGA CCGACAAGGG CATCGTGGAG 
TTCGGCGACC ATCGAACGTG GTACCGCGTG ACGGGCCGGC TGCACGAGGG TCAGCCGCCG
CTCGTCGTGC TGCACGGCGG CCCGGGCAGC ACCCACGACT ACCTGCTCAG CCTGGCCGAG
CTGAGCCACT CCGGTCGCCC GGTGGTGCAC TACGACCAGC TCGGTAACGG CGGCTCCACC
CATCTGCGGG ACCGTGGCGC CGACTTTTGG ACGGTGGAGC TGTTCCTGGC CGAGCTGGAC
AACCTCCTGC GCCGGCTCGG CGTCACCGAC GAGTACGTCC TGCTCGGGCA GTCCTGGGGT
GGGGTCCTGG CGGCGGCCCA CGCGGTGGAC CGACCCGCCG GGCTGCGCGG CCTGGTGATC
GCCAACGCGC CCGCGTCGTA CCCGCTGTGG CTGTCCGAGC TGGACGTGCT GCGGGCCGCG
TTGCCCCCCG GCGTGGACGC GACACTGCGT CGACACGAGG CCGCCGGCAC CACCGACAGC
CCCGCCTACG TGGCCGCGAT GATGGTCTTC TACCAGCGGC ACGTGTGCCG GCGTAAGCCG
TTGCCGCCGG AGCTGATGGC CACCTTCATG GAGATCAACG GTGATCCGAC CGTCTACCAC
TCCATGAACG GGCCGAGCGA GTTCTGCGTG ACCGGGACCC TGCGCGACTA CTCGCTGGTC
GACCGTCTGC CGCAGATCGA CGCGCCCACC CTGGTCATCA GCGGCGAGCA CGACGAGGTC
ACCCCGGCCG CCGTGCGCCC CTTCCACGAT CTCGTCCCCG GTGCTCGCTG GGAGATCGTC
GATGGGGCCA GTCACCTGCC TCACCTGGAG ACCCCGGAGC GGTTCACCGA AATCCTCACC
GAGTTTCTCG ACCGGCTCTG A
 
Protein sequence
MTTTGVDAMT LAPTDKGIVE FGDHRTWYRV TGRLHEGQPP LVVLHGGPGS THDYLLSLAE 
LSHSGRPVVH YDQLGNGGST HLRDRGADFW TVELFLAELD NLLRRLGVTD EYVLLGQSWG
GVLAAAHAVD RPAGLRGLVI ANAPASYPLW LSELDVLRAA LPPGVDATLR RHEAAGTTDS
PAYVAAMMVF YQRHVCRRKP LPPELMATFM EINGDPTVYH SMNGPSEFCV TGTLRDYSLV
DRLPQIDAPT LVISGEHDEV TPAAVRPFHD LVPGARWEIV DGASHLPHLE TPERFTEILT
EFLDRL