Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2073 |
Symbol | |
ID | 5703284 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 2382400 |
End bp | 2383953 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641271559 |
Product | extracellular solute-binding protein |
Protein accession | YP_001536930 |
Protein GI | 159037677 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.610417 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.462432 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTACCC CTGTCAGATC CCGCCTGCTC CGGCGCGGGC TGCTGCCCCT CACCATCGCC GCCCTGCTGC TGGCCGGCTG CGGCACCGAC ACCACGACCG GTAGTTCCGC CGACGAGCCC GGCACGCCGG TCGACGGCGG CACCCTGCGC TACGTCGTAC CGGGATCGCC GGCGACCGCG AGCAACGACC CACATGGCGG GCTCGGCAAC GAGTCCGACC TCATGCGCTT CGCGCTGACC TACGACGTGC TCACCGTGCC CGGCGCCGAC GGCACCCCGC AGCCGCGCCT GGCCCAAACG TGGAAGGCGA ACCAGAGCCT GGACCGCTGG ACGTTTCACC TGCGCGAGGA CGCCACCTTC ACCGACGGCC AGCCGGTACG CGCCAAGGAC GTGCTCTACT CGCTGACCCG GATAGCGGAC AAGGCCGCCG AGAACTACGG CCGACTGGCC GACTTCGACA TGGCCGCCGC CAGCGCACCA GACGACCACA CGGTGGTGTT GGCGACCCGG GCGCCGATGG CCGAAGCACC GAAGGCGCTG GAATCGATCA GCTTCGTCGT TCCCGAGGGC AGCACGGACT TCGCCGAGCC GGTCCGCGGC TCAGGACCGT TCCGGGTGAC CGAGACCGAC GCCCAGACCG CCGTACTCCT GCGAAACGAC GACTGGTGGG GCGAACGACC GCACCTGGAC CGGATCGAGA TCCGGGCCGT CGCCGACCCG CAGGCTCGCG CCGCCGCCGT GACCTCCGGC CAGGCGGACG TCGCCGGAAG CGTCAGCCCG GCGGCGGTCA AAGCCGCCGA GGCCGGCGGT GACGTGCAGG TGGTCCGCCG CAAGGGCGTG ACCGAGTACC CGATCATCAT GCGCCTGGAC TCCGCACCGT TCGACGATCC ACGAGTGCGG GAGGCGTTCC GTCTCGCGAC CGACCGGCAG GCCCTCGTCG ACACGGTGTT CCTCGGATAC GGCCAGATCG CCAACGATCT GCCCACCCCG TACGACCCGT CGTACCCGCA GGATCTGACG CAGCGCACCC GGGACCTGGA CCGGGCCAGG GAACTACTCG AGCAGGCCGG ACACGCGAAC GGGCTGACGC TGACCCTGCA CACCACGACG TCGTACCCCG GCATGGACAC CGCGGCCACC CTGTGGGCCA GGCAACTCGC CGACGTCGGC GTACAAGTCG ACGTGAAGGT GGAGCCAGCC GACACCTACT GGACCGCCAT CTACGCCAAG AAGGACTTCT ACGTCGGCTA CTACGGCGGC ATCTCCTTCC CCGACCTGGT ACGCGTCGGT CTGCTGGCCG CCTCGCCGAC CAACGAGACC GCCTGGCGCA ACGCGTCGTT CGACGCCGAG TTCAACGCCG CCATGGGCAT CCTGGACCCG GCCGAGCGCA ACACCCGACT GGCCCGTATC CAGCAGGAGC TGTGGCGCGA CGGCGGGTAC GTGGTGTGGG GCGTCGGTGA TGGGTTGGAC CTGACCGTCC CCGGTGTGCA CGCTCTGCCC GACGGTCCCG GCTTCCAGCG GATGTTCATC GAACGCGCCT GGAAGACGAG GTGA
|
Protein sequence | MPTPVRSRLL RRGLLPLTIA ALLLAGCGTD TTTGSSADEP GTPVDGGTLR YVVPGSPATA SNDPHGGLGN ESDLMRFALT YDVLTVPGAD GTPQPRLAQT WKANQSLDRW TFHLREDATF TDGQPVRAKD VLYSLTRIAD KAAENYGRLA DFDMAAASAP DDHTVVLATR APMAEAPKAL ESISFVVPEG STDFAEPVRG SGPFRVTETD AQTAVLLRND DWWGERPHLD RIEIRAVADP QARAAAVTSG QADVAGSVSP AAVKAAEAGG DVQVVRRKGV TEYPIIMRLD SAPFDDPRVR EAFRLATDRQ ALVDTVFLGY GQIANDLPTP YDPSYPQDLT QRTRDLDRAR ELLEQAGHAN GLTLTLHTTT SYPGMDTAAT LWARQLADVG VQVDVKVEPA DTYWTAIYAK KDFYVGYYGG ISFPDLVRVG LLAASPTNET AWRNASFDAE FNAAMGILDP AERNTRLARI QQELWRDGGY VVWGVGDGLD LTVPGVHALP DGPGFQRMFI ERAWKTR
|
| |