Gene Strop_2620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_2620 
Symbol 
ID5059083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp2941801 
End bp2942763 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content68% 
IMG OID640474876 
Productproline iminopeptidase 
Protein accessionYP_001159442 
Protein GI145595145 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.000526602 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.51631 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGCGTC TGTATCCCGA GATCGAACCC TTCGCAGACG GCTTGCTCGA TGTCGGGGAC 
GGCCACCTCG TCCATTGGGA GAGCTGCGGT AACCCACTCG GCAAGCCGGC ACTGGTATTG
CATGGCGGCC CCGGTTCCGG TGCCAGTGCC TCCTGGCGGC GATTCTTCGA TCCGGCCGTG
TACCGGGTGG TCCTGTTCGA CCAGCGGGGG TGCGGGCGCA GCACACCGGA CGCGGGCGAC
GTGCGAACCG ACCTGTCGAC CAACACCATG CCGCATCTGC TGGCTGACAT CGAAAAGCTG
CGCACACACC TGAACATCGA CCGGTGGTTG CTGCTCGGCG GATCGTGGGG CAGCGCGCTC
GGCCTTGGCT ATGCCCAGCG GCACCCCGAC CGGGTCACCG AGATCGTGTT GTTCAGTGTC
GTCACCAGCA CCCCGGCCGA GCATCGGTGG ATCACCCGCG ACCTTGGACG GATCTTCCCT
GAACAGTGGG ACAGGTTCCG GGATGCGGTG CCGGCCGCCG AACGCGACGG CAACCTGCCC
GCCGCCTACG CCCAGCTGCT GGCCGATCCG GACGAGACAG TGCGGGACCG AGCCGCACGC
GCCTGGTGCG CCTGGGAGGA CACACTCGTG TCGAACCTGC CCGGCAGTGG ACCCGACCCC
AGGTTCGAGG ACCCGGTGTT CCGGATGACT TTCGCCCGCC TTGTCACCCA CTACTGGGCG
CATGACGGTT GGTTCGCCGA CGGTGAGTTG ATGGCAGGTG CACACCGGCT TGCGGACGTT
CCCGGTGTGC TTGTCCACGG CAGGCTCGAC CTGGGCAGCC CGGCGGACGT CCCGTGGCAA
CTGTCCAAGG CCTGGCCCGC GGCGCGGGTG GAGCTGATTG ACGAGGCCGG TCATGGCGCC
GGACACGGCA TCGGGGACGC GGTCATCAAC GCCCTGGATC GTTTCGGCGC TTCCTGGCGG
TGA
 
Protein sequence
MVRLYPEIEP FADGLLDVGD GHLVHWESCG NPLGKPALVL HGGPGSGASA SWRRFFDPAV 
YRVVLFDQRG CGRSTPDAGD VRTDLSTNTM PHLLADIEKL RTHLNIDRWL LLGGSWGSAL
GLGYAQRHPD RVTEIVLFSV VTSTPAEHRW ITRDLGRIFP EQWDRFRDAV PAAERDGNLP
AAYAQLLADP DETVRDRAAR AWCAWEDTLV SNLPGSGPDP RFEDPVFRMT FARLVTHYWA
HDGWFADGEL MAGAHRLADV PGVLVHGRLD LGSPADVPWQ LSKAWPAARV ELIDEAGHGA
GHGIGDAVIN ALDRFGASWR