Gene Sare_1626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1626 
Symbol 
ID5703470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1862643 
End bp1863731 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content70% 
IMG OID641271134 
Productglycine betaine/L-proline ABC transporter, ATPase subunit 
Protein accessionYP_001536509 
Protein GI159037256 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4175] ABC-type proline/glycine betaine transport system, ATPase component 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCATCA TGAGTACGGA TACGGCGGTT CAGCCGCGCG GGCAGGTGGA CCATCGCCAG 
ACCCCGGTGG TCTCGGTCCG GAACCTGTGG AAGGTGTTCG GCCCGAACGC GGACCAGGTG
CCGAAGTCGG CGGAACTCGG GGCGCTGTCC CGGCGTGAGC TGCTGGAGCG GGCCCGGTGT
ACCGCCGCGG TGCGTGAGGT GTCGTTCGAC GTCGCGCCCG GTGAGGTCTT CGTCGTGATG
GGGTTGTCCG GTTCCGGCAA GTCCACGCTG GTGCGCTGCC TGACCCGGCT GATCGAGCCG
ACCGCGGGCG AGGTCGTGTT CGAGGGCGAG GACATCCTGC GCGCCGACAA GCGGCGGCTG
CGGGAGCTAC GCCGGCACAA GTTCTCGATG GTCTTCCAGC ATTTCGGTCT CCTGCCGTAC
CGGACCGTTG TCGACAACGT CGGATACGGC CTGGAGATCC GTGGCGCCGG CCGCGCCGAG
CGTACCCGGC GGGCGACCGA GGTCATCGAG CTGGTGGGCC TCGACGGCTA CGAGCACGCG
TACCCGGACC AGCTCTCCGG CGGGATGCAG CAGCGCGTCG GACTGGCCCG GGCGCTGGCC
GGTGACCCGG ATGTGCTCTT CTTCGACGAG CCGTTCTCCG CGTTGGACCC GCTGATCCGC
CGGGACATGC AGAACGAGGT CATCCGACTG CACCGTCAGG TGGGCAAGAC GATGGTCTTC
ATCACGCACG ACCTCTCCGA GGCGCTCAAA CTCGGCGACC GGATTCTGCT CATGCGGGAC
GGCAACGTGG TGCAGTCCGG GACCGGGGAC GAGTTGGTCG GGGCACCGGC CGACGACTAC
GTGCGCGACT TCGTCCAGGA GGTGCCCCGC GCCGACGTCC TCACCCTGCG GTGGATCATG
CGTCCACGAC GGGACACCGA CCAGTTGGAC GGCCCCGAGC TGGGGCCGGG CGTCATCGTG
CGGGACGCGG TCCGCACGGT GCTCGCCGCC GACCGGCCGG TGCGGGTCGT CGAGAACGGG
GAACTGCTGG GCGTGGTCGG CGCCGAGGAG GTCCTCGACA TCGTCGCCGG TACGCAGGCG
GGCGCCTGA
 
Protein sequence
MGIMSTDTAV QPRGQVDHRQ TPVVSVRNLW KVFGPNADQV PKSAELGALS RRELLERARC 
TAAVREVSFD VAPGEVFVVM GLSGSGKSTL VRCLTRLIEP TAGEVVFEGE DILRADKRRL
RELRRHKFSM VFQHFGLLPY RTVVDNVGYG LEIRGAGRAE RTRRATEVIE LVGLDGYEHA
YPDQLSGGMQ QRVGLARALA GDPDVLFFDE PFSALDPLIR RDMQNEVIRL HRQVGKTMVF
ITHDLSEALK LGDRILLMRD GNVVQSGTGD ELVGAPADDY VRDFVQEVPR ADVLTLRWIM
RPRRDTDQLD GPELGPGVIV RDAVRTVLAA DRPVRVVENG ELLGVVGAEE VLDIVAGTQA
GA