Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4213 |
Symbol | |
ID | 5707951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4781834 |
End bp | 4783465 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641273632 |
Product | extracellular solute-binding protein |
Protein accession | YP_001538985 |
Protein GI | 159039732 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0562482 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0711224 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGGGA AATTCCTGAA GGTGGCGGTC GCGGCGACCG CCACCGCCAT GTTGGCCACC GCCTGCGGCG GCGGCAGTGA CGACTCGGAC GATGCTGGCG GTCAAGCCGG TGGCACGCTT CGCGTTTACG CCACAGAGCC GGCGTTCCTG CTGCCATCGG CCGCCAATGA CGAGCCGTCG ATCTACGTGA TCCGGCAGCT CTACCGTGGC CTGGTCAAGT ACAACGCCGA GAACAGCGCC GTCGAGAACG ACCTGGCCGA GTCGATCACC TCGGACGACC AGAAGCTCTG GACGATCAAG CTGAAGGATG GCTACACCTT CGACAACGGT GAGCCGGTCG ACGCCGAGTC CTTCATCCGG TCGTGGAACT ACGCTGCTTA CGGGCCGAAC GGCCAGAACA ACGCCTACTT CATGACGCGG ATCGCCGGTA TCGACGCCCT CCAGCCCAAG GACCCGGATG GCGAGGACGG CCCGAAGGAG GCTCCGGAGC CGACGGCCGA GACGCTGTCG GGTCTGAAGA AGGTCGACGA CCTGACCTTC ACCGTCGAGC TCAAGGAGCC GTTCTCCGGC TTCCCGACCG TGGTCGGCTA CTCGGGCTTC TTCCCGATGG CCAAGGCGTG CCTCGACGAC ACGGACAAGT GCAACGAGAC CCCGATCGGC AACGGCCCGT ACAAGATGGA CGGCGCCTGG GAACGCGACG TCCAGATCAA CCTGGCTCGC AGCGAGTCGT GGAAGGGTGA GCCGGGTAAG CCAGCGAAGA TCAACTACCG GATCTTCGCG GACGTCGGTT CCGGTTACTC CGCCTTCCAG GCCGGCGAGC TGGACGTGAT GTACACCCTG CCGCCGGAGC GCTTCAAGGA CGCCAAGGCC AGCTACGGCG ACCGTCTGTA CGAGCAGACG GGCGACAGCC TCAACTACGT CGGTATGCCG CTGTACAACG AGAACTTCAA GGACAAGCGG GTCCGCCAGG CGCTCTCGCT GGCGATCGAC CGGCAGTCCA TCGTGGACGC GGTGTTCGAC GGCCGGAACG CGCCGGCCAC GGGCTTCGTC GGGCCGACCT TCCAGGGTGC TCGCGAGGGC GTCTGCGAGT ACTGCAAGAA GGACGTCGAG AAGGCCCAGG CGCTGCTCGC CGAGGCTGGC GGTTGGAAGG GTGGCAAGCT GACCCTGTGG GCCAACGCCG GTGCCGGTCA CGACGCCTGG CTGCAGGCGG TCGGCGACCA GATCAAGGCC GCGCTGGGTA TCGACTACGA GCTGAAGGTC AACCTGCAGT TCCCCGAGTA CCTGGAGACC GCCAAGGGTC GGAAGTTCAC CGGTCCGTTC CGGCTCGGTT GGGGCCCGGA CTACCCGTTC CTGGAGACCT ACCTGGCTCC GGTCTACGGC GGTGGTAACG ACAACAACTA CTCCACCTTC AACAACCCCA ATTTCGACGC TCTGCTGAAG CAGGGTGACT CCGCCGCGAG CATCGACGAG GCAATCCCCT TCTACCAGAA GGGTGAGGAC ATCCTCGCCG AGGAGATGCC GGTTATCCCG ATCTACTGGC GTAAGGAGGC GGCGCTCTAC AGCGAGAACG TCGACAACTT CGTCTGGAAC CAGGTCTCGG GCGCCGACTA CGGTGCGACT TCGCTGAAGT AG
|
Protein sequence | MRGKFLKVAV AATATAMLAT ACGGGSDDSD DAGGQAGGTL RVYATEPAFL LPSAANDEPS IYVIRQLYRG LVKYNAENSA VENDLAESIT SDDQKLWTIK LKDGYTFDNG EPVDAESFIR SWNYAAYGPN GQNNAYFMTR IAGIDALQPK DPDGEDGPKE APEPTAETLS GLKKVDDLTF TVELKEPFSG FPTVVGYSGF FPMAKACLDD TDKCNETPIG NGPYKMDGAW ERDVQINLAR SESWKGEPGK PAKINYRIFA DVGSGYSAFQ AGELDVMYTL PPERFKDAKA SYGDRLYEQT GDSLNYVGMP LYNENFKDKR VRQALSLAID RQSIVDAVFD GRNAPATGFV GPTFQGAREG VCEYCKKDVE KAQALLAEAG GWKGGKLTLW ANAGAGHDAW LQAVGDQIKA ALGIDYELKV NLQFPEYLET AKGRKFTGPF RLGWGPDYPF LETYLAPVYG GGNDNNYSTF NNPNFDALLK QGDSAASIDE AIPFYQKGED ILAEEMPVIP IYWRKEAALY SENVDNFVWN QVSGADYGAT SLK
|
| |