Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_1620 |
Symbol | |
ID | 5703401 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 1854415 |
End bp | 1855398 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641271128 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001536503 |
Protein GI | 159037250 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.676362 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAACC GAACTGCACT CACCCGCCTG GTGGCCGGCT CCACGCTGGT CGCCCTCGCG CTCAGCGGCT GCTCGGTGAC CACCGAGGAG TCCGGCGCCG ACGTATCAGT CGGCAAGGGG TCGATCAAGG AAGACTCCTC CCTCAAGGGA CAGACCATCG TGGTCGGTTC CAAGGACTTC ACCGAGAACA TCGTCTTCGG GCACATCACG ATGCAGGCAC TGACCGCCGC CGGCGCGGAG GTCGAGGACA AGACCAACAT CAAGGGCTCG GTCAACGTCC GCAAGGCGCT CCTCAGCGGT GACGTCGACG TCTACTGGGA CTACACCGGC ACGGCGTGGA TCACCTACCT CGACCACGCC GACCCGATTC AGAACTCCGC CGAGCAGTAC GCGGCCGTGG TCAAGGAAGA CAAGGAAAAG AACAACGTGG TCTGGGGGGC CTTTGCCCCC GCGAACAACA CCTACGCCCT CGCGGTACGC GAGGAGAAGG CCGAGGAGTG GAACCTCAAC ACGCTGTCCG ACCTGGCCGC GTTCGCCAAG AGTAACCCGG AGGACGCGAC GTTCTGCCTG GAGAGCGAGT TCGTCGGGCG CAACGACGGC TGGCCCGGGA TGACCAAGGC GTACGGGATG AACGTCCCGG CGGACAGCGT CAAGGTGGTC GACACCGGCG TCGTCTACAC CGAAACGAAG AAGGGCGAGG CCTGCAACTT CGGCGAGGTG TTCACCACCG ATGGACGGAT CAGTCACCTG AACCTACGGG TGCTGGAGGA TGACCAGAGC TTCTTCCCGA TCTACAACCC GGCCCCCACG CTCAATGGGG ACACGGCCGC TTCGTACGGC AGCATCCTGA CGATTCTCGA GCCGATCGTG GCCAAGCTCG ACGACGATAC CCTCCGGCAG CTCAACGAGA AGGTGGATGT CGACGGTGAG CCGGTGGCGC AGGTCGTCTC CGAGTGGCTG AAGGCCGAGG GCTTCATCGG CTGA
|
Protein sequence | MRNRTALTRL VAGSTLVALA LSGCSVTTEE SGADVSVGKG SIKEDSSLKG QTIVVGSKDF TENIVFGHIT MQALTAAGAE VEDKTNIKGS VNVRKALLSG DVDVYWDYTG TAWITYLDHA DPIQNSAEQY AAVVKEDKEK NNVVWGAFAP ANNTYALAVR EEKAEEWNLN TLSDLAAFAK SNPEDATFCL ESEFVGRNDG WPGMTKAYGM NVPADSVKVV DTGVVYTETK KGEACNFGEV FTTDGRISHL NLRVLEDDQS FFPIYNPAPT LNGDTAASYG SILTILEPIV AKLDDDTLRQ LNEKVDVDGE PVAQVVSEWL KAEGFIG
|
| |