Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0178 |
Symbol | |
ID | 5706335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 190378 |
End bp | 191529 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641269704 |
Product | extracellular ligand-binding receptor |
Protein accession | YP_001535104 |
Protein GI | 159035851 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00242186 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAGGCGTA AGTACGTACG GGCGCTCGGC GCCATGGGGT TGTCGGCAGC TCTCATTGCT GCGGCGGGCT GCCAGGCGGC GGAGGACGAC ACCGCTGGCG GCAGCGGTGA CTGCGGTGGC AAGATCGCCA TGTTCGGTGC CTACAGCGGC CCGAACTCGG GTCTGGCGAT CCCGGCGCTC AAGAGTGCCC AGCTTGCGGT GAAGCAGCAT AACGAGGCGA ACCCGGACTG CGAGGTGACC CTGCAGGAGT TCGATACCAA GGGTGACCCG ACCGAAGCCA CACCGGTCGC GAACAGGGTC GCCGGCGACG CGTCATTCCT GGGCGTCATC GGTGGTGCGT TCTCCGGTGA GAGTAAGGCG ACGATGGATG TCTACGAGGC GGCCGAGATG GTGATGGTCA GCCCGTCGTC GACCGCGATC GAGTTGACCG CGGGCGGTAA CGAGGTGTTC CACCGGGTGG TCGGGAACGA CGCCGTCCAG GGGGCTGCCG CCGCCGTCTA CCTTCGGGAT GTCGTGCGGG CCAGGAAGGT CTTCGTGATC AATGACGGCA CCACCTATGG CGCCGGCATC ACCGACGAGC TGACCCGGGC CCTCGGCGAG TTGGCGGGCG GCACCGACCA GGTGCAGGAA AAGCAGGTCA ACTTCGCCGC CACCATTTCG AAGATCAAGG CTGCCGCGCC GGACGCGATC GCGTACGGCG GCTACTCAAA CGAGGCGGCT CCGCTGGTGA AGCAGATGCG GGAGGCCGGA GTCACGGCCA CCTTCCTCGG CCTCGACGGG ATCTACGACC CGTCCTTTCC GGAGGGTGCC GGGGCCAGCG CCGAGGGTGC GATCGTGACC TGTCTGTGTC TGCCGTCGGA CAAGGCAGGC GGAACCTTCG CGGCCGACTT CGAGGCGGAG TACGGAGTGC CTCCGGGTCC CTTCGGCGCC GAGGGTTTCG ACGCCGCCAA CGTGTTGATC GAGGGCCTGG CCGAGGGCAA CACCACGCGC AAGGACCTGC TGGCGTGGGT CGACTCCTAC GACAAGGAGG GAGTCTCGAA GTCCCTCAAG TTCGATGAGA ACGGTGACGT GGACAAGTCC CGGGTGGTGA CGTGGGCCTA CGAGATCAAG GGCGGTGAGA TCACGGCCCA GCAGGAGATC AAGCTCAGCT GA
|
Protein sequence | MRRKYVRALG AMGLSAALIA AAGCQAAEDD TAGGSGDCGG KIAMFGAYSG PNSGLAIPAL KSAQLAVKQH NEANPDCEVT LQEFDTKGDP TEATPVANRV AGDASFLGVI GGAFSGESKA TMDVYEAAEM VMVSPSSTAI ELTAGGNEVF HRVVGNDAVQ GAAAAVYLRD VVRARKVFVI NDGTTYGAGI TDELTRALGE LAGGTDQVQE KQVNFAATIS KIKAAAPDAI AYGGYSNEAA PLVKQMREAG VTATFLGLDG IYDPSFPEGA GASAEGAIVT CLCLPSDKAG GTFAADFEAE YGVPPGPFGA EGFDAANVLI EGLAEGNTTR KDLLAWVDSY DKEGVSKSLK FDENGDVDKS RVVTWAYEIK GGEITAQQEI KLS
|
| |