Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3486 |
Symbol | |
ID | 5703549 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4020923 |
End bp | 4022254 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641272913 |
Product | glycine betaine/L-proline ABC transporter, ATPase subunit |
Protein accession | YP_001538279 |
Protein GI | 159039026 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4175] ABC-type proline/glycine betaine transport system, ATPase component |
TIGRFAM ID | [TIGR01186] glycine betaine/L-proline transport ATP binding subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0571528 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGACCG TGTCCGCACT CAGCGTTAAA TCTCTGTACA AGGTTTACGG GAACCGCCCC GACAAGGTGG TCGACCTTCT CCGGGACGGT ACCCACCAGG ACGAGAAGCT GGCCGGCATC TCGGCCACAG CGGCCGTCGT CGACGCGAAC TTCGAGGTCA AGCGTGGCGA GATCTTCGTT GTCATGGGCC TGTCCGGATC GGGGAAGTCA ACCCTCATCC GGATGCTCAA CGGACTCCTG AAGCCCACGC ACGGCACCGT CGAGGTGGAC GGGGTCGACC TCACGAAACT CAAACCCACG GCGCTGCGAA AGCTACGCCG CGAGAAGATC AGCATGGTCT TCCAGCACTT CGCGCTGCTA CCGCACCGGT CCGTCCTGGC CAACGCCGGT TACGCGCTGG AGGTGGCGGG CCTGCCCCGG GATGAGCGCC GCGAGCGGGC CCTCCAGGCA CTGCGGATGG TCGGCCTGGA CGCCTGGGCG GACAAGCTGC CGCAGGAACT GTCCGGCGGC ATGCGCCAGC GTGTCGGCCT GGCCCGCGCA CTCGCCGCCG GCACCGACAT CATGCTCATG GACGAGGCGT TCTCCGCCCT CGACCCGCTC ATCCGCCGCG AGGTCCAGGA CCAGCTGCTC GAGCTACAGG CCGAACTCGG CAAGACGATC GTGTTCATCA CCCACGACCT CAACGAGGCG ATGCGCCTCG GAGACCGGAT CGCGGTCATG CGGGACGGCC GGATCGTCCA GATCGGTACC GCCGAGGAAA TCCTCACCGC TCCGGCCAAC GACTACGTCG CCCAGTTCGT CGCCGACGTC GACCGGACCC GGATTCTCAC CGCGTCATCG GTCATGGAAC GGCCATCCGG CGTCGTCGAC GTGAACGCCG GTCCGCAGGT GGCCACCCGC GTGCTGCGTG AGACCCAGGC CTCAGCCATC TACGTCGCCG GGCCGGACGA CAGGTACCTC GGTACGGTGA GCGCCGACGC CGCCACCACT GCCGCCCGCG ACGGCGGGAA GAACCTTCAG GGGATCGTCT CGACCGAGGA TGTCCAGACC GTCGCCGAAG ACACTCCCGT CGCCGAGCTC TTCGCGCCCT GTGCGACCAG CCGACATCCG GTCGCCGTCG TCGACGACAC CGGGCGACTG ACCGGGGTGG TCTCACAGGT GGCACTCCTC AACGCCCTGG GCAACCTCGG CGTCGAGAAC GGCGACCGGG GCCAGTCGCC CCCGGCGGTG GGCACGGAGC CCGCGTCCGC CGGAGCGGGC GCCGCCACCT CGGCGGTAGC GGCGCAGCGG GTGGCCGCAC AACCGGTCCG GCTGAAGGAG GATCCCGCAT GA
|
Protein sequence | METVSALSVK SLYKVYGNRP DKVVDLLRDG THQDEKLAGI SATAAVVDAN FEVKRGEIFV VMGLSGSGKS TLIRMLNGLL KPTHGTVEVD GVDLTKLKPT ALRKLRREKI SMVFQHFALL PHRSVLANAG YALEVAGLPR DERRERALQA LRMVGLDAWA DKLPQELSGG MRQRVGLARA LAAGTDIMLM DEAFSALDPL IRREVQDQLL ELQAELGKTI VFITHDLNEA MRLGDRIAVM RDGRIVQIGT AEEILTAPAN DYVAQFVADV DRTRILTASS VMERPSGVVD VNAGPQVATR VLRETQASAI YVAGPDDRYL GTVSADAATT AARDGGKNLQ GIVSTEDVQT VAEDTPVAEL FAPCATSRHP VAVVDDTGRL TGVVSQVALL NALGNLGVEN GDRGQSPPAV GTEPASAGAG AATSAVAAQR VAAQPVRLKE DPA
|
| |