Gene Sare_1620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1620 
Symbol 
ID5703401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1854415 
End bp1855398 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content64% 
IMG OID641271128 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001536503 
Protein GI159037250 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.676362 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAACC GAACTGCACT CACCCGCCTG GTGGCCGGCT CCACGCTGGT CGCCCTCGCG 
CTCAGCGGCT GCTCGGTGAC CACCGAGGAG TCCGGCGCCG ACGTATCAGT CGGCAAGGGG
TCGATCAAGG AAGACTCCTC CCTCAAGGGA CAGACCATCG TGGTCGGTTC CAAGGACTTC
ACCGAGAACA TCGTCTTCGG GCACATCACG ATGCAGGCAC TGACCGCCGC CGGCGCGGAG
GTCGAGGACA AGACCAACAT CAAGGGCTCG GTCAACGTCC GCAAGGCGCT CCTCAGCGGT
GACGTCGACG TCTACTGGGA CTACACCGGC ACGGCGTGGA TCACCTACCT CGACCACGCC
GACCCGATTC AGAACTCCGC CGAGCAGTAC GCGGCCGTGG TCAAGGAAGA CAAGGAAAAG
AACAACGTGG TCTGGGGGGC CTTTGCCCCC GCGAACAACA CCTACGCCCT CGCGGTACGC
GAGGAGAAGG CCGAGGAGTG GAACCTCAAC ACGCTGTCCG ACCTGGCCGC GTTCGCCAAG
AGTAACCCGG AGGACGCGAC GTTCTGCCTG GAGAGCGAGT TCGTCGGGCG CAACGACGGC
TGGCCCGGGA TGACCAAGGC GTACGGGATG AACGTCCCGG CGGACAGCGT CAAGGTGGTC
GACACCGGCG TCGTCTACAC CGAAACGAAG AAGGGCGAGG CCTGCAACTT CGGCGAGGTG
TTCACCACCG ATGGACGGAT CAGTCACCTG AACCTACGGG TGCTGGAGGA TGACCAGAGC
TTCTTCCCGA TCTACAACCC GGCCCCCACG CTCAATGGGG ACACGGCCGC TTCGTACGGC
AGCATCCTGA CGATTCTCGA GCCGATCGTG GCCAAGCTCG ACGACGATAC CCTCCGGCAG
CTCAACGAGA AGGTGGATGT CGACGGTGAG CCGGTGGCGC AGGTCGTCTC CGAGTGGCTG
AAGGCCGAGG GCTTCATCGG CTGA
 
Protein sequence
MRNRTALTRL VAGSTLVALA LSGCSVTTEE SGADVSVGKG SIKEDSSLKG QTIVVGSKDF 
TENIVFGHIT MQALTAAGAE VEDKTNIKGS VNVRKALLSG DVDVYWDYTG TAWITYLDHA
DPIQNSAEQY AAVVKEDKEK NNVVWGAFAP ANNTYALAVR EEKAEEWNLN TLSDLAAFAK
SNPEDATFCL ESEFVGRNDG WPGMTKAYGM NVPADSVKVV DTGVVYTETK KGEACNFGEV
FTTDGRISHL NLRVLEDDQS FFPIYNPAPT LNGDTAASYG SILTILEPIV AKLDDDTLRQ
LNEKVDVDGE PVAQVVSEWL KAEGFIG