Gene Sare_1628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1628 
Symbol 
ID5703472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1865799 
End bp1866773 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content64% 
IMG OID641271136 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001536511 
Protein GI159037258 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.634582 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATCCA TTGTTAACAG GAGGGCCCGT CTGGCGGGCA TCTCACTGTC CACGGTGGCG 
GCACTCGCGC TCACCGCGTG CGGTGGGGCC AAGGTCGAGT CGTCGGATGC CGCCGATTCG
GGCGACTGCG GTACGTTCAC CATCGCGATC AACCCCTGGG TCGGGTACGA GGCGAACGCG
GCCGTCATCG CCCACGTCGC CGAGACCGAA CTCGGCTGCA CGGTCGTCAA GAAGGACCTC
AAGGAGGAGA TCGCCTGGCA GGGCTTCGGC ACCGGTCAGG TGGACGCGAT CGTGGAGAAC
TGGGGCCACG ACGACCTCAA GAAGAAGTAC ATCGAGGACC AGAAGACCGC GGTCGACGCC
GGTTCGACCG GCGTGGAGGG AGTCATCGGC TGGTACGTGC CGCCATGGAT GGCCGAGGAG
TACCCCGACA TCACCGACTG GAACAACCTG AACAAGTACG CGTCCCTGTT CGAAACCACG
GAGTCCGGTG GCAAGGGGCA GCTGCTCGAC GGCGACCCGT CCTTCGTCAC CAACGACGAA
GCCCTGGTCA AGAACCTGGA GCTGGACTAC AAGGTGGTCT ACGCGGGCAG TGAGCCGGCG
TTGATCCAGG CGTTCCGCCA GGCGGAGAAG GAGAAGAAGC CGGTGCTCGG CTACTTCTAC
GACCCGCAGT GGTTCCTCTC CGAGGTCGAA CTGGTGAAGG TGAACCTGCC CGAGTACCAG
GAGGGCTGCG ACGCCGACCC GGAGAAGGTC GCCTGCGACT ACCCGGTGTA CGACCTCGAC
AAGATCGTCA GCAAGTCGTT CGCCGACGCC AACGGGCCGG CGTACCAGCT GGTCAACAAC
TTCACCTGGA CCAACGAGGA CCAGAACCTG GTGGCCCGGT ACATCGCCCA GGACAACATG
TCGCCGGAAG AGGCGGCCAA GAAGTGGGTC GAGGCCAACA AGGACAAGGT CGAGGCCTGG
CTGCCGCAGA GCTGA
 
Protein sequence
MRSIVNRRAR LAGISLSTVA ALALTACGGA KVESSDAADS GDCGTFTIAI NPWVGYEANA 
AVIAHVAETE LGCTVVKKDL KEEIAWQGFG TGQVDAIVEN WGHDDLKKKY IEDQKTAVDA
GSTGVEGVIG WYVPPWMAEE YPDITDWNNL NKYASLFETT ESGGKGQLLD GDPSFVTNDE
ALVKNLELDY KVVYAGSEPA LIQAFRQAEK EKKPVLGYFY DPQWFLSEVE LVKVNLPEYQ
EGCDADPEKV ACDYPVYDLD KIVSKSFADA NGPAYQLVNN FTWTNEDQNL VARYIAQDNM
SPEEAAKKWV EANKDKVEAW LPQS