Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4723 |
Symbol | |
ID | 5706025 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5347259 |
End bp | 5348395 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641274121 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001539467 |
Protein GI | 159040214 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000278675 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGACGAAGT CCGGCCCCAG TTTGTCGTAC CCGCCTGGAG GATGTTGGGG TAGAAGACGC GGGCGACCCG ACCGCCCGTC GGACACGCGG CCGGCCCGAA AGGCCGTGCC GGGACACGGA AGGCGGGCAG ACATGCACGC GCGCACACGT TTGGCGATCG GTGCCGTCGG CGCCCTGGCC ACGGCTGGGC TGTTGACTGG CTGCGGCGAG GCCGGGTCGT CGGGCACGGA CGCACCCGAG CCGGCTGCCT CCGGGCAGGG ATGTGCCCCG GTGGCCGGCG ACGAGCTGAT CGTGCTGACC GACGACAAGT TGCTCCAGAA CAGCGACAAT GTCATCCCCG CGGTCAACGC CGACGTCGCG TCACCGCAGC TCGTCGCCGC CCTGGACCGG GTCTCCGCGG CACTGGACAC ACCGAAGCTG ATTCAGCTCA ACAAGGCTGT CGACGTGGAC CGCAAGACGC CGGCGGTCGC GGCGGCCGAG TTCGCCGCCG CCAACGACCT CACCACCGGC GTCGAGAAGG GCTCCGGCGG CACGATCACG GTTGGTGCCG CCAACTTCAG TGAGAGCCAG ACCCTCGCCG AGCTTTACAA GATCGTGCTG ATCGCCGCCG GCTATCAGGC CGAGGTGCAG CAGGTCGGCA GCCGCGAACT CTACGAGCCC GCCCTGGAGA AGGGCGAGAT CCAGGTGTTC CCTGAGTACG CGGCCACGAT GGCGGAGTTC CTCAACACCA AGGCGAACGG CAAAGACGCC CCGCCGGTCT CCTCGCCGGA TCTGGACGAG ACGGTTGCCG CGCTGAAGGC GGCCGGTGCG AAGGTCAACC TGGCCTTCGG TTCGCCCTCC GCGGCCCAGG ACCAGAATGC GTTTGCCGTT ACGCGTGCCT TCGCCGACAA GTACGACGTG CGTACCCTCT CCGAACTGGC CGAGAAGTGC TCCGGTCAGG AGACCGTCCT GGCCGGTCCC CCGGAGTGCC CGCAGCGGCC GAAGTGCCAG GTCGGGCTGG TCGAGGTCTA CGACTTCAAG GCCGGGTCGT TCAGCTCGCT CGACGCTGCC GGGCCGCAGA CGAAGAATGC CCTGAAGACC GGTGTGGCCA GCGTGGGTCT GGTGCTCTCC TCCGACGGGG CGCTCGCCGC CGGCTGA
|
Protein sequence | MTKSGPSLSY PPGGCWGRRR GRPDRPSDTR PARKAVPGHG RRADMHARTR LAIGAVGALA TAGLLTGCGE AGSSGTDAPE PAASGQGCAP VAGDELIVLT DDKLLQNSDN VIPAVNADVA SPQLVAALDR VSAALDTPKL IQLNKAVDVD RKTPAVAAAE FAAANDLTTG VEKGSGGTIT VGAANFSESQ TLAELYKIVL IAAGYQAEVQ QVGSRELYEP ALEKGEIQVF PEYAATMAEF LNTKANGKDA PPVSSPDLDE TVAALKAAGA KVNLAFGSPS AAQDQNAFAV TRAFADKYDV RTLSELAEKC SGQETVLAGP PECPQRPKCQ VGLVEVYDFK AGSFSSLDAA GPQTKNALKT GVASVGLVLS SDGALAAG
|
| |