Gene Sare_2145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2145 
Symbol 
ID5706963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2467385 
End bp2469190 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content67% 
IMG OID641271630 
Productextracellular solute-binding protein 
Protein accessionYP_001537001 
Protein GI159037748 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.377347 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0104158 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCGT CCCTACCGGT CCTCACCGGC GACGACACAC GCCCGCACCG GGGCGGCACG 
GTGACCTGGG CGTGCGCGCC CGGTTTCCCA CCCGCCGTGA TCTTCCCCTT CACGCCCGCC
GAACGGATGG GCACCCGCAA CATCTACGAG TTCCAGATGC TGATGTACCG CCCGCTGTAC
TACTTCGGCA GCAAGGGCAC CCCAGAGGTC GACTACGAGC AGAGCATCGG CGAGCCACCG
CGGTGGAGCG ACGACGGGCT CACCGTCCGA ATCCGGATCA AGCCGTGGAA GTGGTCCAAC
GGCGAGACCC TCTGCGCGGA CAACGTGCTG TTCTGGGTGA ACCTGATGAA GGTCAAGGGC
GACCGTTACG GCGAGTACGT CCCGGGCTAC TTCCCGGACA ACTGCACCGA CTACGGCAGG
GATGGTGAGG ACAGCGTCTG GTTCACCTTC GACAAGCCGT ACTCCCGCAA GTGGGTCCTG
ATGAACCAGC TCAGCACCAT CACCCCGCTA CCCCGGGCGT GGGACCGGAC CGCCGACGGA
CCGGCAGACG CCTCCCGCGG CCTGGCCGAC GTCGCCGCGG TCTACGACTA CCTCATGGCC
GAGCAGGGCG ACATCGTCGC GGAAAGCAAC AGGCATCGCA CTCGCTGGGC CGACAGCCCT
GTCTGGAGCG TGGTCGACGG CCCCTGGCGG CTGAAAAGCT ACACCCTGGA GGGAGTCGTC
ACCTTCGTGC CCAACCAGCA CTACTCCGGG CCGAATCGGC CCTATGTGGA CGAGTTCCGA
CAGGTGCCCA CGATGTCCGA CGACGAGGAG TACCGCATGC TCCAGGCCGG GCCGCGAGGC
CCGGACTCCG TTCAGGTCGG GTACCTGCCG CTGAGCTTCA CCACCGAGCC GACCGACGAT
CCCACCCGAG GTGGGGCCAA CCCGCTCGCC CCGGACTATC GGCTGGTGCC TCAGGTCGCG
TTCTGCATCC GGTACTTCTG CCTGAACTAC AACAACCCGA CCGTCGCCGG GCGGATTTTC
ACCCAAACGT ACTTCCGGCA GGCGCTACAG TGCACCCTGG ACCAGGACGC GGCGGTCCGC
GACATCTACC ACGGATACGC GTACCGGCAG AACGGTCCGG TGCCGATGGT GCCGGCGACC
GACCTGGTCT CCCCACGGCA GCGTGCCGGC GCCTGGCCGT TGCCGTTCGA CCCGGACCGG
GCCCGCCGGC TACTACAGGA CAACGGCTGG GACACCAGCA CCACACCGGC GGTGTGCGTC
CGGCCCGGTA CCGGTCCCGG CGCGGCCGGC GCGGGTATCC CCGCCGGCAC CCGGCTCAGC
GTCCTGCTCC GGTACGTGGA AGGTCGGCCA GCGCTGACCC GGCTGATGAC CACCTTCCAA
CGCGACGCCG CCACCGCCGG GATCGAGATC CGGTTGGAGG AGGTGTACGG CTCCGTCCTG
GTGGCTGAGG ACGCACCCTG CGTGCCAAGT CCGGACACCC CCTGCCGGTG GGAGATGTGC
TGTTGGAACG GCGGCTGGGC CTACCACCAT CCGACCGGCG AGATCCTCTT CCGCACCGAC
GCCGGCGGCA ACTTCGGCCA CTGGAGTGAC CCTGTTACCG ACGAACTCAT CGAACGCACC
GTCACCAGCG ACGACCCGGC CGTCCTGTAC GAGTACCAGG ACCACATCGC CGAACAGGTC
CCGGTCATCT TCACCCCCAA CTTCCCGATC CGCCTCTTCG AGGTCTCCAG TGACCTGCGG
GGATTCGAAC CGGTCAACCC GTTCGGCATG ATCAACCCGG AGAACTGGTA CTACGTCGAC
CACTAG
 
Protein sequence
MTSSLPVLTG DDTRPHRGGT VTWACAPGFP PAVIFPFTPA ERMGTRNIYE FQMLMYRPLY 
YFGSKGTPEV DYEQSIGEPP RWSDDGLTVR IRIKPWKWSN GETLCADNVL FWVNLMKVKG
DRYGEYVPGY FPDNCTDYGR DGEDSVWFTF DKPYSRKWVL MNQLSTITPL PRAWDRTADG
PADASRGLAD VAAVYDYLMA EQGDIVAESN RHRTRWADSP VWSVVDGPWR LKSYTLEGVV
TFVPNQHYSG PNRPYVDEFR QVPTMSDDEE YRMLQAGPRG PDSVQVGYLP LSFTTEPTDD
PTRGGANPLA PDYRLVPQVA FCIRYFCLNY NNPTVAGRIF TQTYFRQALQ CTLDQDAAVR
DIYHGYAYRQ NGPVPMVPAT DLVSPRQRAG AWPLPFDPDR ARRLLQDNGW DTSTTPAVCV
RPGTGPGAAG AGIPAGTRLS VLLRYVEGRP ALTRLMTTFQ RDAATAGIEI RLEEVYGSVL
VAEDAPCVPS PDTPCRWEMC CWNGGWAYHH PTGEILFRTD AGGNFGHWSD PVTDELIERT
VTSDDPAVLY EYQDHIAEQV PVIFTPNFPI RLFEVSSDLR GFEPVNPFGM INPENWYYVD
H