Gene Sare_1622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1622 
Symbol 
ID5703403 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1856241 
End bp1857437 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content67% 
IMG OID641271130 
Productglycine betaine/L-proline ABC transporter, ATPase subunit 
Protein accessionYP_001536505 
Protein GI159037252 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1125] ABC-type proline/glycine betaine transport systems, ATPase components 
TIGRFAM ID[TIGR01186] glycine betaine/L-proline transport ATP binding subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCG ACGTCATGAT TGATCTTCAA CAGGTCAGCA AGTTCTACCG CGGGCAGAAG 
GCCCCCGTGG TGGAGAACAT GTCGATGACG ATCCACCGGG GTGAGATCGT CGTCCTGGTC
GGCCCCTCTG GCTGCGGCAA GACCACGACC ATGAAAATGA TCAACCGTTT GATCGAGCCC
AGCAGCGGCC GGATCCTCAT CGACGGAACC GACGTGACCG CACTGGACGG CAACGACCTA
CGCCGACAGA TCGGCTACGT CATTCAGCAG GTTGGGCTCT TCCCACACAT GAGCGTCGCC
ACCAATGTCG GATTGGTGCC GAAGATGCTC GGCTGGGACC GCAAGCGTAT CGAGGCCCGG
GTCGACGAAC TCCTGCACCT CGTCGGCCTC GAACCGGCCA CCTACCGCAA CCGGTTGCCC
CGCCAGCTCT CCGGCGGGCA GCAGCAGCGC GTCGGGGTGG CCCGGGCCCT CGCCGCGGAT
CCGCCGGTGA TGCTGATGGA CGAGCCGTTC GGCGCCACCG ACCCAATGAC CCGGGACAGG
CTGCAGAACG AGTTCCTCCG CCTCCAAGAT CAGTTGCGCA AGACGATCGT CTTCGTGACC
CACGACTTCG ACGAGGCGAT CAAGATGGGC ACCCGGATCG CCGTCCTCGG GGAGAGGTCC
AGGATTCGGC AGTTCGACAC CCCCGAAGTC CTGCTGGCGC ACCCCGCCGA CAGCACCGTG
GCCCAGTTCA TCGGCGGCGG CGCCCAGCTG AAACAGCTCG ACCTTCGCCG GGTCGACGCG
ATCCAGTGGG ACGACGTCCC CCTGATCCGG GTGGACAAGG CCGCCACCGG TACGCGACGC
CACCCCGACG GGGCCGACGG CTCCACGGCG CTCACCGTGG ACGACGACAA CCGCCCGCTG
GGCTGGATCA GCGCCCACGA TCGGGCGATC CGGCAGGGCG CTCCCGAGGG TGCCAGCCAG
GCGGTGACGA CGGTGGAGCC ACAGGCGACG TTGCGTGATG CCCTCGACGC GATGCTCGCC
TCACGCCACG GCACCGCCGT GGTGGTCGAC GAGCAGGGCC GCTACGCCGG CGCGGTGACA
CTCGACGGCC TGATGCGGGT GATTCACCCC ACGCGAGAGC AGGACCGGCT GGACTCCGCT
GTTCCCGGTC AGACAACGGG TGAGGTCCGC TCCGCGCTGA CGCCGGAGAA ACGATGA
 
Protein sequence
MSRDVMIDLQ QVSKFYRGQK APVVENMSMT IHRGEIVVLV GPSGCGKTTT MKMINRLIEP 
SSGRILIDGT DVTALDGNDL RRQIGYVIQQ VGLFPHMSVA TNVGLVPKML GWDRKRIEAR
VDELLHLVGL EPATYRNRLP RQLSGGQQQR VGVARALAAD PPVMLMDEPF GATDPMTRDR
LQNEFLRLQD QLRKTIVFVT HDFDEAIKMG TRIAVLGERS RIRQFDTPEV LLAHPADSTV
AQFIGGGAQL KQLDLRRVDA IQWDDVPLIR VDKAATGTRR HPDGADGSTA LTVDDDNRPL
GWISAHDRAI RQGAPEGASQ AVTTVEPQAT LRDALDAMLA SRHGTAVVVD EQGRYAGAVT
LDGLMRVIHP TREQDRLDSA VPGQTTGEVR SALTPEKR