Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0701 |
Symbol | |
ID | 5706301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 779222 |
End bp | 780493 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641270219 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_001535611 |
Protein GI | 159036358 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.161591 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.175564 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTCCCC AACAGACCAC GGTTCGGTCT CGGTGGCGAC TGGTGATCGC CGCCGCTCTG CTGATGGCCG TCACCGCATG CACCGGGTCC AGCCTCGGCG GCTCGGACAA CAGCGGTGAG GAGATCAGCA TCGGTCTGAT CGTGCCGAAG TCCGGCCCCT ACAAGGCGAT CGGCGACGAC CAGGCCGCCG GGTGGCGGCT GGCGCTGGAA AAGCTGGGTG GTGAGCTGGG CGGTCGGCCG GTGCGGGTGA TCGAGGCCGA CGAGGGCGAC GGCAAGGCCA CCGCCCTCGC GTCGGCGCGC AAACTGATCG AGCAGGACAA GGTCCTCGCC CTGGTGGGCG GGGCGACCGC GGACACGGTG CAGACGGTGT ACCCGCTGTT GAAGGAGTCC GGGGTACCCC TGATCGGCAT CGGCGGACGG CCATCGACCG TGGATGACCC GACCTACCTG TGGTCGACGT CGTGGTTGAG TCAGGAGACC GGTGCGTCGA TCGCTGACTA TCTGCGTCAG GAGGTGGGCG AGGGCCGGGT GTGGGTCATC GGCCCGGACT ACATCGGGGG GCACGACCAG ATCGGCGGGT TCGTCGACGC GTTCCGGAGA GCCGGCGGAA AGCTCGCCAA CCCGGGTGAG GAACCGACGT GGACGCCGTG GGCGCCAGCC CCGACAACCG AGTTCTCTCC GTACCTGACG AAGATCAAGG ATTCTGGTGC GGCCGCAGTT TACACCTTCT ACGCCGGCAC CAGCGCCGTC GAGTTCGTCA AGCAGTACCG GCAGTACGGC ATCACCACGC CGCTGTACGC TTCGGGCTTT CTTACCGAAG GCCCCTCGTT GACGGCGCTC GGCGAACAGG CCAAGGGCAT CTACACGGCG TTGAACTACT CCACCACCCT GGATAACGCC GCGAACCGGG ACTTTGTACG TCGATTCGCG GCGGCCAACG ATGGAAAGCT GCCCAATCTC TACCACGTGT GCGCGTGGGA CGCAGCGCTC GTACTGGACA AGGCCATCGC CGAAGCGCTC GCCCACCCGA ACGCGCCGGC GGCCACGTCG GACGGCCAGT CCGCCCCGGC CGCGGAGAGC GGTGAGCTGA CCTCGCAGTC GCTGACGGCG GCGTTGTCGC GGGTCGGGTC GATCGACTCA CCCCGGGGGG CCTGGCAGTT CGGGCCGGTG AACCATACCC CGGTGCAGGC CTACTACCTG CGCGAGGTCG CCCAGGACGG ACAGGTGTGG GTCAACCGCA CCGTGCAGAC ACTGACCACT CTCGGCTCCT GA
|
Protein sequence | MTPQQTTVRS RWRLVIAAAL LMAVTACTGS SLGGSDNSGE EISIGLIVPK SGPYKAIGDD QAAGWRLALE KLGGELGGRP VRVIEADEGD GKATALASAR KLIEQDKVLA LVGGATADTV QTVYPLLKES GVPLIGIGGR PSTVDDPTYL WSTSWLSQET GASIADYLRQ EVGEGRVWVI GPDYIGGHDQ IGGFVDAFRR AGGKLANPGE EPTWTPWAPA PTTEFSPYLT KIKDSGAAAV YTFYAGTSAV EFVKQYRQYG ITTPLYASGF LTEGPSLTAL GEQAKGIYTA LNYSTTLDNA ANRDFVRRFA AANDGKLPNL YHVCAWDAAL VLDKAIAEAL AHPNAPAATS DGQSAPAAES GELTSQSLTA ALSRVGSIDS PRGAWQFGPV NHTPVQAYYL REVAQDGQVW VNRTVQTLTT LGS
|
| |