Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0075 |
Symbol | |
ID | 5707209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 86607 |
End bp | 89450 |
Gene Length | 2844 bp |
Protein Length | 947 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641269601 |
Product | peptidase M28 |
Protein accession | YP_001535001 |
Protein GI | 159035748 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG2234] Predicted aminopeptidases [COG3227] Zinc metalloprotease (elastase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.11414 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.170085 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGATCGTGA CGGTGCAGGC TGCCGGGGCG ATCATCGCCC TCGGGTACAC GTCCGCCAGC GAGCTGCCGG AGCTGGACGC GCTCGAACGC CTGGCGATCA GCACTGCCGC CGCCGATCAG TATGTCTCCG GCACCTCCAC CGGGTTTGTG AAAGGTGAGG ACGAGAAGTA CAGCCGTCAG CAGGTGACCC CCGCCTCGCG CGGCCTGACG TACGTCACCT ACTCCCGGAC TTACAAGGAC CTACCGGTAT TCGTCGGGGG TGAGGCCGTG GTGGTGACCG ACAGCAAGAG CACCGTCCGC AGCAGCACCG CTGGCTCGGG GCCGCTGCAG GTGTCCACGT CGGCGAAAGT GAGCGCCAAG AAGGCCAAGG CAGTGGCGTT GGGCAAGCAC CAGGGCACCG TACGAGGCGA GCCGACGCTG GGTGTGATCG CCGAAGGTGA TGGCCGGCTC GTCTGGGAGG TCGTCGTGAG CGGCGACGGC GAGCACGGGC CGAGTGTGGC GCACGCATTC GTGGATGGGC AGACCGGCGC TTACGTTGGC TCCTGGGACG AGGTTGCTCG AGGCACGGGC AACGGACACT ACAACGAGCA GGTTCACCTC GACACCACCG GCAGCGCCGG TCGCTTCGAA TTGCGTGATC CGACGCGCGG CAATATGCGA ACTCTGAACG ACGCCGCGTC CGGCACTCCA TTCACCGACG CCGACGACGT GTGGGGCAAC GGCGTCGGCT CTGACCTGGT CACCGGAGCG GTCGACACCC AGTATGCGGG CGCGGCGGTG TGGGACATGC TGGAGGAGAA GTTCTCGCGC AATGGCATCG ATGACGCGGG TAGGACCGCG ACCATGTTCG TCGGCCTTCC TGCCGCCAAT GCCTACTACG CCTGCGCCGG TACCGGCGAT GCGAGCCGCG ACCAGACCAA GTATGGCCGC ACCACCGACC GTGCCCGACA GGTCAACTCA GTAGACGTGG TTGCGCACGA GCTGGGCCAC GGCATCTTCT GCCACACTCC CGGCGGCAGC CGTGGCATCA CGAACGAGAC CGGCGGCCTC AACGAGGGTA CGGGCGACAT TTTCGGCGCG CTCGCCGAGC ATTTCGTTGC CAATCCGAAC GACCCAGCCG ACTACCTGGT CGGCGAGGAA GTCAACCTGT CCGGGCGTGG CCCAATTCGT ACGATGTACG ATCCATCCAA GAACGGCGAC CCGAACTGTT GGTCAACCGA CATCCCGAGA ACTCGGGTGC ACTCGGCGGC TGGACCGCTC AACCACTGGT TCTACCTCGC GGCTGAGGGC TCCAAGCCCG CTGGCAAGCC GGCCAGCCCC ACCTGCAACG GCACTGACGT TACTGGCATC GGGCTGTGGC AGGCCGGTGA GATCTACTAC CACGCGCTGC TGCGCAAGAC CTCGGGCTGG ACGTACACGC AGGCACGGAA GGCGACCCTG GACGCCACTC GAGAGCTGTA CCCGAACAGC TGCGCCGAAT TCAACGTGAT CAAGGCAGCG TGGAACGCGG TGAGCGTCCC GGCGCAGGGT GACCCCACCT GCACGACCGG CACGCCAACG CCGACGGCCT CGCCGTCAGC GCCAGGTCCC TCATCGTCGC CGACATCGCC ACCCGGTGGA GAGCCGGCGG CAGCACCGGA CATTGATGGC GCCAAGGTCG AGGCACACCT CGAGGAACTC GGCCGAATCG CCGCCGCGAA CGGCGGTAAC CGGGCACACG GCACCCCGGG GTACCGGGCT TCGCTCGACT ATGTGAAGGG CGAACTCGAC GCCGCGGGCT ACAACACCCG GATCCAGCAG TTCAACTCCG GCGGCAAACC CGGGTTCAAC CTGATCGCCG ACCTGCCCGA CCGAGAAGAC CACGACAAGG TGGTCATGCT CGGCGCGCAC CTGGACAGCG TCGACATTGG GCCTGGCATC AACGACAACG GCAGTGGCTC TGCTGGCATC CTCGAGGTCG CTCTGACCTA CGCCGCCAGC GGCGCGAAAG GCGACAAGGC GATTCGCTTC GGCTGGTGGG GGGCAGAGGA GGACGGCCTG GTCGGGTCCA AGGCGTACGT GACGTCGTTG TCAGCCGCGG AAAGGGAATC GATCACCGCA TACCTGAACT TCGACATGAT CGGTTCGCCG AATCCGGGGT ACTTCGTCTA CAACGACGAC GCCAAGGGTG ACTTCATCAC CGAGGCGCTC GAGGAGGGCT TCGCTGCCGA GGATGTTCCG TCCGAGGGCG TCAGCCTTCG CGGCCGGTCG GATCATGCCC CCTTCATGGC GGTGGGCATC CCCAGCGGCG GCACCGCCAC GCTGAGCCTC GTGCCGGTGA TGAGCCAGGC CCAGGCCGCC AAGTGGAACG GCAAGGCCGG GCAGCCGTTT GACCCCTGCT ACCACAGAAA CTGCGACACG GTGGAGAACA TCAGCACCGC TGCCCTGGAC ACGCACACGG ATGTGGCCGC GTACGCCGCG TGGAAGTTGA CCGGTGTGCA CGCCGCTGGC GGATCAGCGG GCTCCACGCA CGTCACCAAC AACACCCACT TCCCCATCCG TGACCGGTCG GTCGTCGAGT CCCCCATTAC GGTGCAGCGC GACGGACCTG CGCAGGCCGT TCGTGAGGTG AAAGTCGATA TCGTGCACTC CTACCGCGGC AACCTGGAGA TCCATCTGCT GGCACCCGAC GGCACCGAAT ACCTCATCAA GCGTCCGAGC CGCCTCGACA GGGCGGACGA CGTCAAGCTA ACAAAGCCGA TCGACTCCTC GGCGGAGAAA ACCGATGGAA CCTGGAAACT GCGGGTACGT GACCTGCACT CTGGCAACAT TGGCACGCTC CGCTCCTGGA GCCTGATTTT CTAG
|
Protein sequence | MIVTVQAAGA IIALGYTSAS ELPELDALER LAISTAAADQ YVSGTSTGFV KGEDEKYSRQ QVTPASRGLT YVTYSRTYKD LPVFVGGEAV VVTDSKSTVR SSTAGSGPLQ VSTSAKVSAK KAKAVALGKH QGTVRGEPTL GVIAEGDGRL VWEVVVSGDG EHGPSVAHAF VDGQTGAYVG SWDEVARGTG NGHYNEQVHL DTTGSAGRFE LRDPTRGNMR TLNDAASGTP FTDADDVWGN GVGSDLVTGA VDTQYAGAAV WDMLEEKFSR NGIDDAGRTA TMFVGLPAAN AYYACAGTGD ASRDQTKYGR TTDRARQVNS VDVVAHELGH GIFCHTPGGS RGITNETGGL NEGTGDIFGA LAEHFVANPN DPADYLVGEE VNLSGRGPIR TMYDPSKNGD PNCWSTDIPR TRVHSAAGPL NHWFYLAAEG SKPAGKPASP TCNGTDVTGI GLWQAGEIYY HALLRKTSGW TYTQARKATL DATRELYPNS CAEFNVIKAA WNAVSVPAQG DPTCTTGTPT PTASPSAPGP SSSPTSPPGG EPAAAPDIDG AKVEAHLEEL GRIAAANGGN RAHGTPGYRA SLDYVKGELD AAGYNTRIQQ FNSGGKPGFN LIADLPDRED HDKVVMLGAH LDSVDIGPGI NDNGSGSAGI LEVALTYAAS GAKGDKAIRF GWWGAEEDGL VGSKAYVTSL SAAERESITA YLNFDMIGSP NPGYFVYNDD AKGDFITEAL EEGFAAEDVP SEGVSLRGRS DHAPFMAVGI PSGGTATLSL VPVMSQAQAA KWNGKAGQPF DPCYHRNCDT VENISTAALD THTDVAAYAA WKLTGVHAAG GSAGSTHVTN NTHFPIRDRS VVESPITVQR DGPAQAVREV KVDIVHSYRG NLEIHLLAPD GTEYLIKRPS RLDRADDVKL TKPIDSSAEK TDGTWKLRVR DLHSGNIGTL RSWSLIF
|
| |