Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1661 |
Symbol | |
ID | 5703431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 1912681 |
End bp | 1914216 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641271165 |
Product | extracellular solute-binding protein |
Protein accession | YP_001536540 |
Protein GI | 159037287 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.226847 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCAGCA TCAGATTTCG GGGGGCCGGC GTCGCGCTGG CTCTGACACT CGGTCTGCTG GCGGGATGTA GCGCGGGGGA GGGCGTCGAC GTCGACGGGT CGGGCCAGTC TGGTGCCGGC GGTGTCCTCA CTGTCGCGAT CAGTGGGGAA CCGGATCAGC TGGACCCGCA TCGGACCTCG GCCTACCACA GCTTCCAGGT GCTCGAGAAC GTCTACGACA CGCTCGTGGA GCCGGACGCG AACCTGGCGA TGAAGCCGGC CCTGGCGACG GAGTGGAGCA CCAGCGAGGA CCAGTTGACC TGGACGTTCA CCCTCCGTAA GGGGGTGACG TTCACCGACG GTTCGCCGCT TACCGCCGAG GACGTGGTCT ACTCGTACAC CCGGATCATC GACGAGAAGT TGAATGCGGC GTACCGGTTT TCCACGGTGG AGTCGGTGAC GGCCCCCGAC CCCGGTACCG TCGTCGTGAC GCTGACCGCG CCCACCCCGA ACCTGCTCGC CAGCCTCGGC GGCTTCAAGG GAGTGGCGAT CGTCAAGAAG TCCAACGTCG AGTCGGGCGC GGTGAAGACC GAGCCGATCG GTAGTGGTCC GTTCACTGTG GCCTCCTACA CTGCCGGGGA CAGCATCAAG CTGGTGCGCA ATGACAGCTA CTGGGGCACC AAGCCCAAGC TGGACGGGGT GACCTTCACC TTCGTCAAGG ACCCGACGGT GGCCCTGCAG AACCTGCGCG GTGGTGAGGT GCAGTGGACC GACAACCTGC CCCCGCAGCA GGTGCCGGCG CTTCGGGAGG ACGACGAGCT CGTCGTGCGT TCGGTGCCGT CGAGCGACTA CTGGTACCTG GCCCTCAACC AGTCCCGTGA GCCCTACGAC AACGTCGAGG TACGCCGGGC GGTCGCCTTC GCGCTCGACC GAGCGGCGAT CACCAAGGCC GCCAAGTTCG GGCTGGCGAC GGTCAACCAG ACCGCCATCC CCGAGGACAG CGCCTTCTAC TACGACTACG CGCCGTACCA GCGGGACCCG GCGCAGGCGA AGCAACTGCT GGCCGCGGCC GGCGTGACGG ATCTGACCAT GGACCTGATG GTCACCAACG AGTACCCGGA GACAGTCACC GCAGCGCAGG TCATCGCCGC GCAGCTCAAG GACGTCGGCA TCACTGTCAC GATCCGTACG TTGGATTTCG CCCAGTGGCT CGACGAGCAG GGCAAGGGAA ACTTCGACTC GTTCATGCTC GGCTGGCTGG GCAACATCGA CCCCGACGAG TTCTACTACG CCCAGCACCA CAGCCAGGGC ACCTTCAACT TCCACGGATA CCGCAACCCA GCCGTGGACA GCCTGCTCGA CCAGGCCCGG ACCGAGACCG ACCAGGCCGC GCGTAAGCGG CAGTACGAGC AGGTGGCGAA GCGGATCGTC GACGACGCCA GCTACCTCTA CCTCTACAAC CCGGATGTGG TGCAGGGCTG GTCGCCGCAG GTCAGCGGCT ACCAGGTCCG TGCCGACCGG GCGATTCGGT TCCGCGACGT CAGCCTCGAC CGGTGA
|
Protein sequence | MSSIRFRGAG VALALTLGLL AGCSAGEGVD VDGSGQSGAG GVLTVAISGE PDQLDPHRTS AYHSFQVLEN VYDTLVEPDA NLAMKPALAT EWSTSEDQLT WTFTLRKGVT FTDGSPLTAE DVVYSYTRII DEKLNAAYRF STVESVTAPD PGTVVVTLTA PTPNLLASLG GFKGVAIVKK SNVESGAVKT EPIGSGPFTV ASYTAGDSIK LVRNDSYWGT KPKLDGVTFT FVKDPTVALQ NLRGGEVQWT DNLPPQQVPA LREDDELVVR SVPSSDYWYL ALNQSREPYD NVEVRRAVAF ALDRAAITKA AKFGLATVNQ TAIPEDSAFY YDYAPYQRDP AQAKQLLAAA GVTDLTMDLM VTNEYPETVT AAQVIAAQLK DVGITVTIRT LDFAQWLDEQ GKGNFDSFML GWLGNIDPDE FYYAQHHSQG TFNFHGYRNP AVDSLLDQAR TETDQAARKR QYEQVAKRIV DDASYLYLYN PDVVQGWSPQ VSGYQVRADR AIRFRDVSLD R
|
| |