Gene Sare_2073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2073 
Symbol 
ID5703284 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2382400 
End bp2383953 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content70% 
IMG OID641271559 
Productextracellular solute-binding protein 
Protein accessionYP_001536930 
Protein GI159037677 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.610417 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.462432 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTACCC CTGTCAGATC CCGCCTGCTC CGGCGCGGGC TGCTGCCCCT CACCATCGCC 
GCCCTGCTGC TGGCCGGCTG CGGCACCGAC ACCACGACCG GTAGTTCCGC CGACGAGCCC
GGCACGCCGG TCGACGGCGG CACCCTGCGC TACGTCGTAC CGGGATCGCC GGCGACCGCG
AGCAACGACC CACATGGCGG GCTCGGCAAC GAGTCCGACC TCATGCGCTT CGCGCTGACC
TACGACGTGC TCACCGTGCC CGGCGCCGAC GGCACCCCGC AGCCGCGCCT GGCCCAAACG
TGGAAGGCGA ACCAGAGCCT GGACCGCTGG ACGTTTCACC TGCGCGAGGA CGCCACCTTC
ACCGACGGCC AGCCGGTACG CGCCAAGGAC GTGCTCTACT CGCTGACCCG GATAGCGGAC
AAGGCCGCCG AGAACTACGG CCGACTGGCC GACTTCGACA TGGCCGCCGC CAGCGCACCA
GACGACCACA CGGTGGTGTT GGCGACCCGG GCGCCGATGG CCGAAGCACC GAAGGCGCTG
GAATCGATCA GCTTCGTCGT TCCCGAGGGC AGCACGGACT TCGCCGAGCC GGTCCGCGGC
TCAGGACCGT TCCGGGTGAC CGAGACCGAC GCCCAGACCG CCGTACTCCT GCGAAACGAC
GACTGGTGGG GCGAACGACC GCACCTGGAC CGGATCGAGA TCCGGGCCGT CGCCGACCCG
CAGGCTCGCG CCGCCGCCGT GACCTCCGGC CAGGCGGACG TCGCCGGAAG CGTCAGCCCG
GCGGCGGTCA AAGCCGCCGA GGCCGGCGGT GACGTGCAGG TGGTCCGCCG CAAGGGCGTG
ACCGAGTACC CGATCATCAT GCGCCTGGAC TCCGCACCGT TCGACGATCC ACGAGTGCGG
GAGGCGTTCC GTCTCGCGAC CGACCGGCAG GCCCTCGTCG ACACGGTGTT CCTCGGATAC
GGCCAGATCG CCAACGATCT GCCCACCCCG TACGACCCGT CGTACCCGCA GGATCTGACG
CAGCGCACCC GGGACCTGGA CCGGGCCAGG GAACTACTCG AGCAGGCCGG ACACGCGAAC
GGGCTGACGC TGACCCTGCA CACCACGACG TCGTACCCCG GCATGGACAC CGCGGCCACC
CTGTGGGCCA GGCAACTCGC CGACGTCGGC GTACAAGTCG ACGTGAAGGT GGAGCCAGCC
GACACCTACT GGACCGCCAT CTACGCCAAG AAGGACTTCT ACGTCGGCTA CTACGGCGGC
ATCTCCTTCC CCGACCTGGT ACGCGTCGGT CTGCTGGCCG CCTCGCCGAC CAACGAGACC
GCCTGGCGCA ACGCGTCGTT CGACGCCGAG TTCAACGCCG CCATGGGCAT CCTGGACCCG
GCCGAGCGCA ACACCCGACT GGCCCGTATC CAGCAGGAGC TGTGGCGCGA CGGCGGGTAC
GTGGTGTGGG GCGTCGGTGA TGGGTTGGAC CTGACCGTCC CCGGTGTGCA CGCTCTGCCC
GACGGTCCCG GCTTCCAGCG GATGTTCATC GAACGCGCCT GGAAGACGAG GTGA
 
Protein sequence
MPTPVRSRLL RRGLLPLTIA ALLLAGCGTD TTTGSSADEP GTPVDGGTLR YVVPGSPATA 
SNDPHGGLGN ESDLMRFALT YDVLTVPGAD GTPQPRLAQT WKANQSLDRW TFHLREDATF
TDGQPVRAKD VLYSLTRIAD KAAENYGRLA DFDMAAASAP DDHTVVLATR APMAEAPKAL
ESISFVVPEG STDFAEPVRG SGPFRVTETD AQTAVLLRND DWWGERPHLD RIEIRAVADP
QARAAAVTSG QADVAGSVSP AAVKAAEAGG DVQVVRRKGV TEYPIIMRLD SAPFDDPRVR
EAFRLATDRQ ALVDTVFLGY GQIANDLPTP YDPSYPQDLT QRTRDLDRAR ELLEQAGHAN
GLTLTLHTTT SYPGMDTAAT LWARQLADVG VQVDVKVEPA DTYWTAIYAK KDFYVGYYGG
ISFPDLVRVG LLAASPTNET AWRNASFDAE FNAAMGILDP AERNTRLARI QQELWRDGGY
VVWGVGDGLD LTVPGVHALP DGPGFQRMFI ERAWKTR