Gene Sare_4381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4381 
Symbol 
ID5705072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4952503 
End bp4954134 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content63% 
IMG OID641273803 
Productextracellular solute-binding protein 
Protein accessionYP_001539153 
Protein GI159039900 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000255213 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTGGGA AACTCCTGAA GGTAACCGTC GCGGCGACGG CCACTGCCAT GTTGGTCACT 
GCTTGCACCG GCGGTAACAG CGACGACCCA GCGGAAACGA ACGGTCAATC CGGTGGCGAG
CTACGCGTTT ACGCTTCGGA GCCGGCGTCC CTGGTACCGT CGGCCGCCAA TGATGAACCG
TCGATCTATG TGATCCGTCA GCTCTACCGT GGCCTGATCA AGTACAACGC CCAAACCGGT
GCGGCCGAAC ACGACCTGGC CGAGTCGATC GCGTCGGACG ACCACAAGCT TTGGACCATC
ACAATCAACG GTGGCTACAC CTTCGACAAC GGTGAGCCGG TCGACGCCGA GTCCTTCATC
CGGTCATGGA ACTACGCCGC TTACGGGCCG AACGCCCAGA GCAACGCCTA CTTCATGAAG
CGAATCGCCG GGTTCGACGA TGTTACGGCC AAGGATCCGG ACGGTGATGG TCCGAAGACG
GCCCCGGAGC CGAAGGCCAA GACGCTGTCG GGCCTGAAGA AGGTCGACGA CCTGACCTTC
ACCGTCGAGC TCAAGGAGCC GTTCACTGAC TTCCGGACCA CGCTCGGCTA CTCAGGCTTC
TTCCCGATGG CCCAGGCGTG CGTCGACGAT GAGGACGCGT GCAACGAGAC CCCGATCGGG
AACGGCCCTT ACAAGATCGA AGGCGCCTGG GAGCGTGGCG TCCAGATCAA CCTGACCCGT
AGTGACTCCT GGAAGGGCGA GCCGGGCAAG CCCGACAAGA TCAACTACCG GATCTTCGCG
GACGAAGGCT CTGCGTACTC CGCCTTCCAG GCCGGCGAGC TGGACGTGAT GTACACCCTG
CCGTCGGAGC GTTTCAAGGA CGCCAAGGCA AGCTACGGCG ACCGTCTGTA CGAGCAGCCA
GGCGACAGCC TGAACTACAT CGGCATGCCG CTGTACAACG AGAACTTCAC GGACAAGCGG
GTCCGTCAGG CGATCTCGCT GGCGATCGAC CGGCAGTCGA TCATCGACGC GGTGTTCGAC
GGCCGGTACA CCCCGGCTAC CGGCTTCATC GCGCCGACCT TCCAAAACGT CCGCGAGGGC
GTCTGCACGT ACTGCAGGAA GGACGTCGAG AAGGCCCAGG CGCTGCTCGC CGAGGCCGGT
GGCTGGAAGG GCGGCAAGCT GGTCCTGTGG GCCAACGCCG GTTCCAGTCA CGAAGCCTGG
CTGCAGGCAG CCGGCGACCA GATCAGGGCC GCCCTGGGCA TTGACTACGA GCTGAGGGTC
AACCTGCAGT TCCCCGAGTA CCTGGAGGCT GCCGACAACC GGAAGTTCAC CGGCCCGTTC
CGGCTCGGTT GGGGCCCGGA CTATCCGTCA CTGGAGACCT ACCTGGCTCC CCTGTATGGG
ACCGGGGCCG ACAGCAACAG TTCCACCTTC AGCAACCCCG AGTTCGACCG CCTGATGAAG
CAGGGTGACT CCGCCAGTTC CATCGAGGAG GCGGTCACCT TCTACCAGAA GGGTGAGGAT
ATCCTGGCGG AGGAGCTGCC GGTTATCCCG ATGTTCTGGC GCAAGGTGGG GGCGCTCTAC
AGCGAGAACG TCGACAACTT CGTCTGGAAC CAGTTCTCGG GCGCCGACTA TGGTGCGACC
TCGCTGAAGT AG
 
Protein sequence
MRGKLLKVTV AATATAMLVT ACTGGNSDDP AETNGQSGGE LRVYASEPAS LVPSAANDEP 
SIYVIRQLYR GLIKYNAQTG AAEHDLAESI ASDDHKLWTI TINGGYTFDN GEPVDAESFI
RSWNYAAYGP NAQSNAYFMK RIAGFDDVTA KDPDGDGPKT APEPKAKTLS GLKKVDDLTF
TVELKEPFTD FRTTLGYSGF FPMAQACVDD EDACNETPIG NGPYKIEGAW ERGVQINLTR
SDSWKGEPGK PDKINYRIFA DEGSAYSAFQ AGELDVMYTL PSERFKDAKA SYGDRLYEQP
GDSLNYIGMP LYNENFTDKR VRQAISLAID RQSIIDAVFD GRYTPATGFI APTFQNVREG
VCTYCRKDVE KAQALLAEAG GWKGGKLVLW ANAGSSHEAW LQAAGDQIRA ALGIDYELRV
NLQFPEYLEA ADNRKFTGPF RLGWGPDYPS LETYLAPLYG TGADSNSSTF SNPEFDRLMK
QGDSASSIEE AVTFYQKGED ILAEELPVIP MFWRKVGALY SENVDNFVWN QFSGADYGAT
SLK