Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_1643 |
Symbol | |
ID | 5058102 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | + |
Start bp | 1872285 |
End bp | 1873256 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640473916 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001158486 |
Protein GI | 145594189 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.010471 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATCCA TCGTTAACAA GAGGGTCCTG GCGGGCGTCT CGCTGTCCAC GGTGGCGGCT CTCGCGCTCA CCGCGTGCGG TGGGACCAAG ATCGAGTCGA CGGACCCCGC CGAAGCGGGC GACTGCGGCA CCTTCACCAT CGCGATCAAC CCCTGGGTGG GGTACGAGGC GAACGCGGCC GTCATCGCCC ACGTCGCCGA GACCGAACTC GGCTGCAAGG TCGTCAAGAA GGATCTCAAG GAGGAGATCG CCTGGCAGGG CTTCGGCACC GGTCAGGTGG ACGCGATCGT GGAGAACTGG GGCCACGACG ACCTCAAGAA GAAGTACATC GAGGATCAGA AGACCGCGGT GAACGCCGGT TCGACCGGTG TCGAGGGTGT CATCGGCTGG TACGTGCCGC CATGGATGGC CGAGGAGTAC CCCGACATCA CCGACTGGAA CAACCTGAAC AAGTACGCCT CCCTCTTCGA GACCACGGAG TCCGGCGGCA AGGGACAGCT GCTCGACGGT GACCCGTCCT TCGTCACCAA CGACGAAGCC CTGGTCAAGA ACCTGGGGCT GGACTACCAG GTGGTGTACG CGGGCAGCGA GCCGGCCCTG ATCCAGGCGT TCCGTCAGGC GGAGCAGGAG AAGAAGCCGG TGCTCGGCTA CTTCTACGAC CCGCAGTGGT TCCTCTCCGA GATCGAACTG GTCAAGGTGA ACCTGCCCGA GTACGAGGAG GGCTGCGACG CCGACCCGGA GAAGGTCGCC TGCGACTACC CGGTGTACGA CCTTGACAAG ATCGTGAGTA AGTCGTTCGC CGACGCCAAC GGGCCCGCCT ACCAGCTGGT CGACAACTTC AACTGGAGCA ACGAGGACCA GAACGTGGTG GCCCGGTACA TCGCCCAGGA CAACATGTCG CCGGAGGAGG CGGCTGAGAA GTGGGTCGAG GCCAACAAGG ACAAGGTTGA GGCCTGGCTG CCGCAGAGCT GA
|
Protein sequence | MRSIVNKRVL AGVSLSTVAA LALTACGGTK IESTDPAEAG DCGTFTIAIN PWVGYEANAA VIAHVAETEL GCKVVKKDLK EEIAWQGFGT GQVDAIVENW GHDDLKKKYI EDQKTAVNAG STGVEGVIGW YVPPWMAEEY PDITDWNNLN KYASLFETTE SGGKGQLLDG DPSFVTNDEA LVKNLGLDYQ VVYAGSEPAL IQAFRQAEQE KKPVLGYFYD PQWFLSEIEL VKVNLPEYEE GCDADPEKVA CDYPVYDLDK IVSKSFADAN GPAYQLVDNF NWSNEDQNVV ARYIAQDNMS PEEAAEKWVE ANKDKVEAWL PQS
|
| |