Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2145 |
Symbol | |
ID | 5706963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2467385 |
End bp | 2469190 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641271630 |
Product | extracellular solute-binding protein |
Protein accession | YP_001537001 |
Protein GI | 159037748 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.377347 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0104158 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTCGT CCCTACCGGT CCTCACCGGC GACGACACAC GCCCGCACCG GGGCGGCACG GTGACCTGGG CGTGCGCGCC CGGTTTCCCA CCCGCCGTGA TCTTCCCCTT CACGCCCGCC GAACGGATGG GCACCCGCAA CATCTACGAG TTCCAGATGC TGATGTACCG CCCGCTGTAC TACTTCGGCA GCAAGGGCAC CCCAGAGGTC GACTACGAGC AGAGCATCGG CGAGCCACCG CGGTGGAGCG ACGACGGGCT CACCGTCCGA ATCCGGATCA AGCCGTGGAA GTGGTCCAAC GGCGAGACCC TCTGCGCGGA CAACGTGCTG TTCTGGGTGA ACCTGATGAA GGTCAAGGGC GACCGTTACG GCGAGTACGT CCCGGGCTAC TTCCCGGACA ACTGCACCGA CTACGGCAGG GATGGTGAGG ACAGCGTCTG GTTCACCTTC GACAAGCCGT ACTCCCGCAA GTGGGTCCTG ATGAACCAGC TCAGCACCAT CACCCCGCTA CCCCGGGCGT GGGACCGGAC CGCCGACGGA CCGGCAGACG CCTCCCGCGG CCTGGCCGAC GTCGCCGCGG TCTACGACTA CCTCATGGCC GAGCAGGGCG ACATCGTCGC GGAAAGCAAC AGGCATCGCA CTCGCTGGGC CGACAGCCCT GTCTGGAGCG TGGTCGACGG CCCCTGGCGG CTGAAAAGCT ACACCCTGGA GGGAGTCGTC ACCTTCGTGC CCAACCAGCA CTACTCCGGG CCGAATCGGC CCTATGTGGA CGAGTTCCGA CAGGTGCCCA CGATGTCCGA CGACGAGGAG TACCGCATGC TCCAGGCCGG GCCGCGAGGC CCGGACTCCG TTCAGGTCGG GTACCTGCCG CTGAGCTTCA CCACCGAGCC GACCGACGAT CCCACCCGAG GTGGGGCCAA CCCGCTCGCC CCGGACTATC GGCTGGTGCC TCAGGTCGCG TTCTGCATCC GGTACTTCTG CCTGAACTAC AACAACCCGA CCGTCGCCGG GCGGATTTTC ACCCAAACGT ACTTCCGGCA GGCGCTACAG TGCACCCTGG ACCAGGACGC GGCGGTCCGC GACATCTACC ACGGATACGC GTACCGGCAG AACGGTCCGG TGCCGATGGT GCCGGCGACC GACCTGGTCT CCCCACGGCA GCGTGCCGGC GCCTGGCCGT TGCCGTTCGA CCCGGACCGG GCCCGCCGGC TACTACAGGA CAACGGCTGG GACACCAGCA CCACACCGGC GGTGTGCGTC CGGCCCGGTA CCGGTCCCGG CGCGGCCGGC GCGGGTATCC CCGCCGGCAC CCGGCTCAGC GTCCTGCTCC GGTACGTGGA AGGTCGGCCA GCGCTGACCC GGCTGATGAC CACCTTCCAA CGCGACGCCG CCACCGCCGG GATCGAGATC CGGTTGGAGG AGGTGTACGG CTCCGTCCTG GTGGCTGAGG ACGCACCCTG CGTGCCAAGT CCGGACACCC CCTGCCGGTG GGAGATGTGC TGTTGGAACG GCGGCTGGGC CTACCACCAT CCGACCGGCG AGATCCTCTT CCGCACCGAC GCCGGCGGCA ACTTCGGCCA CTGGAGTGAC CCTGTTACCG ACGAACTCAT CGAACGCACC GTCACCAGCG ACGACCCGGC CGTCCTGTAC GAGTACCAGG ACCACATCGC CGAACAGGTC CCGGTCATCT TCACCCCCAA CTTCCCGATC CGCCTCTTCG AGGTCTCCAG TGACCTGCGG GGATTCGAAC CGGTCAACCC GTTCGGCATG ATCAACCCGG AGAACTGGTA CTACGTCGAC CACTAG
|
Protein sequence | MTSSLPVLTG DDTRPHRGGT VTWACAPGFP PAVIFPFTPA ERMGTRNIYE FQMLMYRPLY YFGSKGTPEV DYEQSIGEPP RWSDDGLTVR IRIKPWKWSN GETLCADNVL FWVNLMKVKG DRYGEYVPGY FPDNCTDYGR DGEDSVWFTF DKPYSRKWVL MNQLSTITPL PRAWDRTADG PADASRGLAD VAAVYDYLMA EQGDIVAESN RHRTRWADSP VWSVVDGPWR LKSYTLEGVV TFVPNQHYSG PNRPYVDEFR QVPTMSDDEE YRMLQAGPRG PDSVQVGYLP LSFTTEPTDD PTRGGANPLA PDYRLVPQVA FCIRYFCLNY NNPTVAGRIF TQTYFRQALQ CTLDQDAAVR DIYHGYAYRQ NGPVPMVPAT DLVSPRQRAG AWPLPFDPDR ARRLLQDNGW DTSTTPAVCV RPGTGPGAAG AGIPAGTRLS VLLRYVEGRP ALTRLMTTFQ RDAATAGIEI RLEEVYGSVL VAEDAPCVPS PDTPCRWEMC CWNGGWAYHH PTGEILFRTD AGGNFGHWSD PVTDELIERT VTSDDPAVLY EYQDHIAEQV PVIFTPNFPI RLFEVSSDLR GFEPVNPFGM INPENWYYVD H
|
| |