Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4381 |
Symbol | |
ID | 5705072 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4952503 |
End bp | 4954134 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 641273803 |
Product | extracellular solute-binding protein |
Protein accession | YP_001539153 |
Protein GI | 159039900 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000255213 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGTGGGA AACTCCTGAA GGTAACCGTC GCGGCGACGG CCACTGCCAT GTTGGTCACT GCTTGCACCG GCGGTAACAG CGACGACCCA GCGGAAACGA ACGGTCAATC CGGTGGCGAG CTACGCGTTT ACGCTTCGGA GCCGGCGTCC CTGGTACCGT CGGCCGCCAA TGATGAACCG TCGATCTATG TGATCCGTCA GCTCTACCGT GGCCTGATCA AGTACAACGC CCAAACCGGT GCGGCCGAAC ACGACCTGGC CGAGTCGATC GCGTCGGACG ACCACAAGCT TTGGACCATC ACAATCAACG GTGGCTACAC CTTCGACAAC GGTGAGCCGG TCGACGCCGA GTCCTTCATC CGGTCATGGA ACTACGCCGC TTACGGGCCG AACGCCCAGA GCAACGCCTA CTTCATGAAG CGAATCGCCG GGTTCGACGA TGTTACGGCC AAGGATCCGG ACGGTGATGG TCCGAAGACG GCCCCGGAGC CGAAGGCCAA GACGCTGTCG GGCCTGAAGA AGGTCGACGA CCTGACCTTC ACCGTCGAGC TCAAGGAGCC GTTCACTGAC TTCCGGACCA CGCTCGGCTA CTCAGGCTTC TTCCCGATGG CCCAGGCGTG CGTCGACGAT GAGGACGCGT GCAACGAGAC CCCGATCGGG AACGGCCCTT ACAAGATCGA AGGCGCCTGG GAGCGTGGCG TCCAGATCAA CCTGACCCGT AGTGACTCCT GGAAGGGCGA GCCGGGCAAG CCCGACAAGA TCAACTACCG GATCTTCGCG GACGAAGGCT CTGCGTACTC CGCCTTCCAG GCCGGCGAGC TGGACGTGAT GTACACCCTG CCGTCGGAGC GTTTCAAGGA CGCCAAGGCA AGCTACGGCG ACCGTCTGTA CGAGCAGCCA GGCGACAGCC TGAACTACAT CGGCATGCCG CTGTACAACG AGAACTTCAC GGACAAGCGG GTCCGTCAGG CGATCTCGCT GGCGATCGAC CGGCAGTCGA TCATCGACGC GGTGTTCGAC GGCCGGTACA CCCCGGCTAC CGGCTTCATC GCGCCGACCT TCCAAAACGT CCGCGAGGGC GTCTGCACGT ACTGCAGGAA GGACGTCGAG AAGGCCCAGG CGCTGCTCGC CGAGGCCGGT GGCTGGAAGG GCGGCAAGCT GGTCCTGTGG GCCAACGCCG GTTCCAGTCA CGAAGCCTGG CTGCAGGCAG CCGGCGACCA GATCAGGGCC GCCCTGGGCA TTGACTACGA GCTGAGGGTC AACCTGCAGT TCCCCGAGTA CCTGGAGGCT GCCGACAACC GGAAGTTCAC CGGCCCGTTC CGGCTCGGTT GGGGCCCGGA CTATCCGTCA CTGGAGACCT ACCTGGCTCC CCTGTATGGG ACCGGGGCCG ACAGCAACAG TTCCACCTTC AGCAACCCCG AGTTCGACCG CCTGATGAAG CAGGGTGACT CCGCCAGTTC CATCGAGGAG GCGGTCACCT TCTACCAGAA GGGTGAGGAT ATCCTGGCGG AGGAGCTGCC GGTTATCCCG ATGTTCTGGC GCAAGGTGGG GGCGCTCTAC AGCGAGAACG TCGACAACTT CGTCTGGAAC CAGTTCTCGG GCGCCGACTA TGGTGCGACC TCGCTGAAGT AG
|
Protein sequence | MRGKLLKVTV AATATAMLVT ACTGGNSDDP AETNGQSGGE LRVYASEPAS LVPSAANDEP SIYVIRQLYR GLIKYNAQTG AAEHDLAESI ASDDHKLWTI TINGGYTFDN GEPVDAESFI RSWNYAAYGP NAQSNAYFMK RIAGFDDVTA KDPDGDGPKT APEPKAKTLS GLKKVDDLTF TVELKEPFTD FRTTLGYSGF FPMAQACVDD EDACNETPIG NGPYKIEGAW ERGVQINLTR SDSWKGEPGK PDKINYRIFA DEGSAYSAFQ AGELDVMYTL PSERFKDAKA SYGDRLYEQP GDSLNYIGMP LYNENFTDKR VRQAISLAID RQSIIDAVFD GRYTPATGFI APTFQNVREG VCTYCRKDVE KAQALLAEAG GWKGGKLVLW ANAGSSHEAW LQAAGDQIRA ALGIDYELRV NLQFPEYLEA ADNRKFTGPF RLGWGPDYPS LETYLAPLYG TGADSNSSTF SNPEFDRLMK QGDSASSIEE AVTFYQKGED ILAEELPVIP MFWRKVGALY SENVDNFVWN QFSGADYGAT SLK
|
| |