Gene Sare_3486 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3486 
Symbol 
ID5703549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4020923 
End bp4022254 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content69% 
IMG OID641272913 
Productglycine betaine/L-proline ABC transporter, ATPase subunit 
Protein accessionYP_001538279 
Protein GI159039026 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0571528 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGACCG TGTCCGCACT CAGCGTTAAA TCTCTGTACA AGGTTTACGG GAACCGCCCC 
GACAAGGTGG TCGACCTTCT CCGGGACGGT ACCCACCAGG ACGAGAAGCT GGCCGGCATC
TCGGCCACAG CGGCCGTCGT CGACGCGAAC TTCGAGGTCA AGCGTGGCGA GATCTTCGTT
GTCATGGGCC TGTCCGGATC GGGGAAGTCA ACCCTCATCC GGATGCTCAA CGGACTCCTG
AAGCCCACGC ACGGCACCGT CGAGGTGGAC GGGGTCGACC TCACGAAACT CAAACCCACG
GCGCTGCGAA AGCTACGCCG CGAGAAGATC AGCATGGTCT TCCAGCACTT CGCGCTGCTA
CCGCACCGGT CCGTCCTGGC CAACGCCGGT TACGCGCTGG AGGTGGCGGG CCTGCCCCGG
GATGAGCGCC GCGAGCGGGC CCTCCAGGCA CTGCGGATGG TCGGCCTGGA CGCCTGGGCG
GACAAGCTGC CGCAGGAACT GTCCGGCGGC ATGCGCCAGC GTGTCGGCCT GGCCCGCGCA
CTCGCCGCCG GCACCGACAT CATGCTCATG GACGAGGCGT TCTCCGCCCT CGACCCGCTC
ATCCGCCGCG AGGTCCAGGA CCAGCTGCTC GAGCTACAGG CCGAACTCGG CAAGACGATC
GTGTTCATCA CCCACGACCT CAACGAGGCG ATGCGCCTCG GAGACCGGAT CGCGGTCATG
CGGGACGGCC GGATCGTCCA GATCGGTACC GCCGAGGAAA TCCTCACCGC TCCGGCCAAC
GACTACGTCG CCCAGTTCGT CGCCGACGTC GACCGGACCC GGATTCTCAC CGCGTCATCG
GTCATGGAAC GGCCATCCGG CGTCGTCGAC GTGAACGCCG GTCCGCAGGT GGCCACCCGC
GTGCTGCGTG AGACCCAGGC CTCAGCCATC TACGTCGCCG GGCCGGACGA CAGGTACCTC
GGTACGGTGA GCGCCGACGC CGCCACCACT GCCGCCCGCG ACGGCGGGAA GAACCTTCAG
GGGATCGTCT CGACCGAGGA TGTCCAGACC GTCGCCGAAG ACACTCCCGT CGCCGAGCTC
TTCGCGCCCT GTGCGACCAG CCGACATCCG GTCGCCGTCG TCGACGACAC CGGGCGACTG
ACCGGGGTGG TCTCACAGGT GGCACTCCTC AACGCCCTGG GCAACCTCGG CGTCGAGAAC
GGCGACCGGG GCCAGTCGCC CCCGGCGGTG GGCACGGAGC CCGCGTCCGC CGGAGCGGGC
GCCGCCACCT CGGCGGTAGC GGCGCAGCGG GTGGCCGCAC AACCGGTCCG GCTGAAGGAG
GATCCCGCAT GA
 
Protein sequence
METVSALSVK SLYKVYGNRP DKVVDLLRDG THQDEKLAGI SATAAVVDAN FEVKRGEIFV 
VMGLSGSGKS TLIRMLNGLL KPTHGTVEVD GVDLTKLKPT ALRKLRREKI SMVFQHFALL
PHRSVLANAG YALEVAGLPR DERRERALQA LRMVGLDAWA DKLPQELSGG MRQRVGLARA
LAAGTDIMLM DEAFSALDPL IRREVQDQLL ELQAELGKTI VFITHDLNEA MRLGDRIAVM
RDGRIVQIGT AEEILTAPAN DYVAQFVADV DRTRILTASS VMERPSGVVD VNAGPQVATR
VLRETQASAI YVAGPDDRYL GTVSADAATT AARDGGKNLQ GIVSTEDVQT VAEDTPVAEL
FAPCATSRHP VAVVDDTGRL TGVVSQVALL NALGNLGVEN GDRGQSPPAV GTEPASAGAG
AATSAVAAQR VAAQPVRLKE DPA