Gene Sare_0075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0075 
Symbol 
ID5707209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp86607 
End bp89450 
Gene Length2844 bp 
Protein Length947 aa 
Translation table11 
GC content66% 
IMG OID641269601 
Productpeptidase M28 
Protein accessionYP_001535001 
Protein GI159035748 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG2234] Predicted aminopeptidases
[COG3227] Zinc metalloprotease (elastase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.11414 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.170085 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCGTGA CGGTGCAGGC TGCCGGGGCG ATCATCGCCC TCGGGTACAC GTCCGCCAGC 
GAGCTGCCGG AGCTGGACGC GCTCGAACGC CTGGCGATCA GCACTGCCGC CGCCGATCAG
TATGTCTCCG GCACCTCCAC CGGGTTTGTG AAAGGTGAGG ACGAGAAGTA CAGCCGTCAG
CAGGTGACCC CCGCCTCGCG CGGCCTGACG TACGTCACCT ACTCCCGGAC TTACAAGGAC
CTACCGGTAT TCGTCGGGGG TGAGGCCGTG GTGGTGACCG ACAGCAAGAG CACCGTCCGC
AGCAGCACCG CTGGCTCGGG GCCGCTGCAG GTGTCCACGT CGGCGAAAGT GAGCGCCAAG
AAGGCCAAGG CAGTGGCGTT GGGCAAGCAC CAGGGCACCG TACGAGGCGA GCCGACGCTG
GGTGTGATCG CCGAAGGTGA TGGCCGGCTC GTCTGGGAGG TCGTCGTGAG CGGCGACGGC
GAGCACGGGC CGAGTGTGGC GCACGCATTC GTGGATGGGC AGACCGGCGC TTACGTTGGC
TCCTGGGACG AGGTTGCTCG AGGCACGGGC AACGGACACT ACAACGAGCA GGTTCACCTC
GACACCACCG GCAGCGCCGG TCGCTTCGAA TTGCGTGATC CGACGCGCGG CAATATGCGA
ACTCTGAACG ACGCCGCGTC CGGCACTCCA TTCACCGACG CCGACGACGT GTGGGGCAAC
GGCGTCGGCT CTGACCTGGT CACCGGAGCG GTCGACACCC AGTATGCGGG CGCGGCGGTG
TGGGACATGC TGGAGGAGAA GTTCTCGCGC AATGGCATCG ATGACGCGGG TAGGACCGCG
ACCATGTTCG TCGGCCTTCC TGCCGCCAAT GCCTACTACG CCTGCGCCGG TACCGGCGAT
GCGAGCCGCG ACCAGACCAA GTATGGCCGC ACCACCGACC GTGCCCGACA GGTCAACTCA
GTAGACGTGG TTGCGCACGA GCTGGGCCAC GGCATCTTCT GCCACACTCC CGGCGGCAGC
CGTGGCATCA CGAACGAGAC CGGCGGCCTC AACGAGGGTA CGGGCGACAT TTTCGGCGCG
CTCGCCGAGC ATTTCGTTGC CAATCCGAAC GACCCAGCCG ACTACCTGGT CGGCGAGGAA
GTCAACCTGT CCGGGCGTGG CCCAATTCGT ACGATGTACG ATCCATCCAA GAACGGCGAC
CCGAACTGTT GGTCAACCGA CATCCCGAGA ACTCGGGTGC ACTCGGCGGC TGGACCGCTC
AACCACTGGT TCTACCTCGC GGCTGAGGGC TCCAAGCCCG CTGGCAAGCC GGCCAGCCCC
ACCTGCAACG GCACTGACGT TACTGGCATC GGGCTGTGGC AGGCCGGTGA GATCTACTAC
CACGCGCTGC TGCGCAAGAC CTCGGGCTGG ACGTACACGC AGGCACGGAA GGCGACCCTG
GACGCCACTC GAGAGCTGTA CCCGAACAGC TGCGCCGAAT TCAACGTGAT CAAGGCAGCG
TGGAACGCGG TGAGCGTCCC GGCGCAGGGT GACCCCACCT GCACGACCGG CACGCCAACG
CCGACGGCCT CGCCGTCAGC GCCAGGTCCC TCATCGTCGC CGACATCGCC ACCCGGTGGA
GAGCCGGCGG CAGCACCGGA CATTGATGGC GCCAAGGTCG AGGCACACCT CGAGGAACTC
GGCCGAATCG CCGCCGCGAA CGGCGGTAAC CGGGCACACG GCACCCCGGG GTACCGGGCT
TCGCTCGACT ATGTGAAGGG CGAACTCGAC GCCGCGGGCT ACAACACCCG GATCCAGCAG
TTCAACTCCG GCGGCAAACC CGGGTTCAAC CTGATCGCCG ACCTGCCCGA CCGAGAAGAC
CACGACAAGG TGGTCATGCT CGGCGCGCAC CTGGACAGCG TCGACATTGG GCCTGGCATC
AACGACAACG GCAGTGGCTC TGCTGGCATC CTCGAGGTCG CTCTGACCTA CGCCGCCAGC
GGCGCGAAAG GCGACAAGGC GATTCGCTTC GGCTGGTGGG GGGCAGAGGA GGACGGCCTG
GTCGGGTCCA AGGCGTACGT GACGTCGTTG TCAGCCGCGG AAAGGGAATC GATCACCGCA
TACCTGAACT TCGACATGAT CGGTTCGCCG AATCCGGGGT ACTTCGTCTA CAACGACGAC
GCCAAGGGTG ACTTCATCAC CGAGGCGCTC GAGGAGGGCT TCGCTGCCGA GGATGTTCCG
TCCGAGGGCG TCAGCCTTCG CGGCCGGTCG GATCATGCCC CCTTCATGGC GGTGGGCATC
CCCAGCGGCG GCACCGCCAC GCTGAGCCTC GTGCCGGTGA TGAGCCAGGC CCAGGCCGCC
AAGTGGAACG GCAAGGCCGG GCAGCCGTTT GACCCCTGCT ACCACAGAAA CTGCGACACG
GTGGAGAACA TCAGCACCGC TGCCCTGGAC ACGCACACGG ATGTGGCCGC GTACGCCGCG
TGGAAGTTGA CCGGTGTGCA CGCCGCTGGC GGATCAGCGG GCTCCACGCA CGTCACCAAC
AACACCCACT TCCCCATCCG TGACCGGTCG GTCGTCGAGT CCCCCATTAC GGTGCAGCGC
GACGGACCTG CGCAGGCCGT TCGTGAGGTG AAAGTCGATA TCGTGCACTC CTACCGCGGC
AACCTGGAGA TCCATCTGCT GGCACCCGAC GGCACCGAAT ACCTCATCAA GCGTCCGAGC
CGCCTCGACA GGGCGGACGA CGTCAAGCTA ACAAAGCCGA TCGACTCCTC GGCGGAGAAA
ACCGATGGAA CCTGGAAACT GCGGGTACGT GACCTGCACT CTGGCAACAT TGGCACGCTC
CGCTCCTGGA GCCTGATTTT CTAG
 
Protein sequence
MIVTVQAAGA IIALGYTSAS ELPELDALER LAISTAAADQ YVSGTSTGFV KGEDEKYSRQ 
QVTPASRGLT YVTYSRTYKD LPVFVGGEAV VVTDSKSTVR SSTAGSGPLQ VSTSAKVSAK
KAKAVALGKH QGTVRGEPTL GVIAEGDGRL VWEVVVSGDG EHGPSVAHAF VDGQTGAYVG
SWDEVARGTG NGHYNEQVHL DTTGSAGRFE LRDPTRGNMR TLNDAASGTP FTDADDVWGN
GVGSDLVTGA VDTQYAGAAV WDMLEEKFSR NGIDDAGRTA TMFVGLPAAN AYYACAGTGD
ASRDQTKYGR TTDRARQVNS VDVVAHELGH GIFCHTPGGS RGITNETGGL NEGTGDIFGA
LAEHFVANPN DPADYLVGEE VNLSGRGPIR TMYDPSKNGD PNCWSTDIPR TRVHSAAGPL
NHWFYLAAEG SKPAGKPASP TCNGTDVTGI GLWQAGEIYY HALLRKTSGW TYTQARKATL
DATRELYPNS CAEFNVIKAA WNAVSVPAQG DPTCTTGTPT PTASPSAPGP SSSPTSPPGG
EPAAAPDIDG AKVEAHLEEL GRIAAANGGN RAHGTPGYRA SLDYVKGELD AAGYNTRIQQ
FNSGGKPGFN LIADLPDRED HDKVVMLGAH LDSVDIGPGI NDNGSGSAGI LEVALTYAAS
GAKGDKAIRF GWWGAEEDGL VGSKAYVTSL SAAERESITA YLNFDMIGSP NPGYFVYNDD
AKGDFITEAL EEGFAAEDVP SEGVSLRGRS DHAPFMAVGI PSGGTATLSL VPVMSQAQAA
KWNGKAGQPF DPCYHRNCDT VENISTAALD THTDVAAYAA WKLTGVHAAG GSAGSTHVTN
NTHFPIRDRS VVESPITVQR DGPAQAVREV KVDIVHSYRG NLEIHLLAPD GTEYLIKRPS
RLDRADDVKL TKPIDSSAEK TDGTWKLRVR DLHSGNIGTL RSWSLIF