Gene Sare_3967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3967 
Symbol 
ID5705244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4505009 
End bp4506355 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content66% 
IMG OID641273392 
Productextracellular solute-binding protein 
Protein accessionYP_001538748 
Protein GI159039495 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0182057 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.154584 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGTCT TCGCCAGACC ACGCCAAGCC CTCGTGATCG CTGGCGCGCT CGGCCTGGCC 
ATCAGTGCCA CCGCCTGCGG TACCGGCGAC AACGACGGCA GCGGTAAGGC CGATTCCCCG
GAATGCGCGG CATACCAGAA GTATCAGGGC CACGGCGGCG CCGAGGTCTC CATCTACGCG
TCCATTCGTG ACGCGGAGGC AGACCTGCTC GAACAGTCGT GGGAGCAGTT CGCAGAATGC
ACCGGCATCG AGATCGACTA CGAGGGCAGC GGCGAGTTCG AGGCGCAGCT CCAGGTGCGG
GTCGACGGCG GCAACGCACC GGACATCGCC TTCGTTCCGC AGCCGGGCCT GCTGAGCCGA
TTCGCGCAGG CCGGCAAGCT CAAGCCGGCA TCGGCCGAGA CCAAGGCGAT GGCCGAGGAA
AACTACGCCG CCGACTGGCT GAAATACAGC ACCGTCGCGG GACAGTTCTA CGGCGCTCCG
CTGGGGTCGA ACGTCAAGTC GTTCGTCTGG TACTCACCAA AGATGTTCCA GGAGCAGGGT
TGGACCGTCC CGACCACCTG GGACGACCTG ATCAAACTCA GCGACTCGGC CGCGGCTGGC
GGCATCAAGC CATGGTGTGT CGGCATCGAG TCCGGTGACG CCACCGGCTG GCCGGCCACC
GACTGGATCG AGGACGTGCT GCTGCGGACG CAGACCCCCG AGGTCTACGA CCAGTGGACC
ACGCACGGCA TACCGTTCAA CGACCAGCGT GTGGTGGACG CGGTCGACCG TGCCGGCACC
ATCCTGCGAA ACGAGAAGTA CGTCAACGGC GGCTACGGCG GCGTGAAGAG CATCGCCACC
ACGTCGTTCC AGGAGGGCGG TCTGCCGATC CTCCAGGGTG AGTGCGCCCT GCACCGGCAG
GCGTCCTTCT ACGCCAACCA GTGGCCCGCG GACAGCCGGG TGGCCGAGGA CGGCGACGTC
TTCGCGTTCT ACTTCCCGGC CATCGACCCG TCGAAGGGCA AGCCGGTGTT GGGAGGCGGC
GAGTTCACCG TCGCTTTCGA CGACCGCCCC GAGGTCCAGG CGGTACAGAC GTACCTCGCC
TCCGGCGAGT ACGCCAACAG TCGGGCCAAG CTGGGCAACT GGGTGTCGGC GAACAGGAAG
CTCGACGTGG CCAACGTCGC GAACCCGATC GACAAGCTGT CGGTCGAGAT CCTTCAGGAC
GAGAGCACGG TCTTCCGCTT CGATGGTTCC GACCTGATGC CCGCCGCCGT CGGCGCCGGG
ACATTCTGGA AGGAGATGGT GTCCTGGATC AGCGGCAAGG ACACCAAGGC GGCCCTGGAC
GCCATCGAGA GTTCCTGGCC CCGCTGA
 
Protein sequence
MAVFARPRQA LVIAGALGLA ISATACGTGD NDGSGKADSP ECAAYQKYQG HGGAEVSIYA 
SIRDAEADLL EQSWEQFAEC TGIEIDYEGS GEFEAQLQVR VDGGNAPDIA FVPQPGLLSR
FAQAGKLKPA SAETKAMAEE NYAADWLKYS TVAGQFYGAP LGSNVKSFVW YSPKMFQEQG
WTVPTTWDDL IKLSDSAAAG GIKPWCVGIE SGDATGWPAT DWIEDVLLRT QTPEVYDQWT
THGIPFNDQR VVDAVDRAGT ILRNEKYVNG GYGGVKSIAT TSFQEGGLPI LQGECALHRQ
ASFYANQWPA DSRVAEDGDV FAFYFPAIDP SKGKPVLGGG EFTVAFDDRP EVQAVQTYLA
SGEYANSRAK LGNWVSANRK LDVANVANPI DKLSVEILQD ESTVFRFDGS DLMPAAVGAG
TFWKEMVSWI SGKDTKAALD AIESSWPR