Gene Sare_2038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2038 
Symbol 
ID5705692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2333213 
End bp2334886 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content74% 
IMG OID641271528 
Productextracellular solute-binding protein 
Protein accessionYP_001536899 
Protein GI159037646 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000868803 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCCAGG CACGCAGCAC AGCGATCAGC GCACCCGGCA CCCGGCAGCG GAACATCCTG 
CGGTTGCTCG GTACCGCCGC CGTCCACCAG GCCGACCCGG CCGCCGCCTG GTCCCCGGCC
GAACGCCAAC TGCTGCGCCT GACCACGCGG CAACTGGTCA GCTACCCAGG GGCGGCTGAC
CCGACCGACT GGCGGGCCCT CGGTCCGGTC GGTGACCTGG CCGTCGACGT GCCGTCCACC
TACAACGCCG GGCTCGGCGC CAGCCACCGC TCGTACGTGG CACACCTGCG TCCGGACGTG
TGGTGGGACA GCCCGCAGCC GAGGCGGATC ACCGCACACG ACGTCGTACG CGGATTCAAG
CGACTCGCCA ACCCCGTGAC CCGTCATCCC GCCCTGCCCT ACTTCCGCAG CACCATACGG
GGCATGGACC GGTACTGCGA CGAGTACGCC GCCGTCGTCG CCGGCCGGCC GGTCACCGCG
GCACTGCTGG CCGCCTTCGC CAACGCGCAC GACCTGCCCG GCGTGTTCGC CCTCGACGAC
GAGACGGTCG TCATCGAACT CCTCCGCCCG GCGTTGGACT TTCCCAACAT GCTCGCCCTG
AGCTGCGCCT CCCCGGCCCC CGCCGAGTAC GACGCGTACC TGCCGGGCAG CACCGAACTG
CACGCGCACC TCGTCGCCAG CGGTCCGTAC CGGGTCGCCA CGTGGCAGCC CGGGGACACC
ATCCGGTTGG AACCCAACCC CACCTGGCGC TCGGAGAGCG ACCCGGTGCG GCACCAGCGT
TTCGACGCCG TGGAGTTCCG GGTGTCCGGC GACGGTCCGC GCCGGCTGGC CGACCAGATC
TCCGCCGACG TGGCCGACCT GCCGTGGGGG GTTCCCGTCG GCGAGGTGAG CGGGTACCGG
GCCGACCCGT TCCTGGTGTT CAACCTGCGC GACCCAGCCA ACCCGGCGAT GACCACGGCA
GCCGTGCGAC AGGTGATCGA CGAGGCGATC GACCGGTCCG CGTTGGCCCG GATCGCCCGC
GTCGGTGACC CGTGGTCGGC GGTTCGCGAG GCGTACACCG TGGTGCCGCC GGGCAACGAC
GGACACCTGC CTTCGGACCC AGCGGCCGAC CCACCGGCGC ACGGCGCCCC CCGCGAGCGG
CTCACCGCCG CCGGCCATCC GAACGGGCTC CTCCTGACCG CGGTGTGCCC CGACCGGACC
GAGGAACTGG CCCTGGCCCG CGCCTGGGCC GCCGACCTGG CTACGGCCGG CATCGAGGTA
CGGCTGGTGG CGTTGGACGA GGCGACGCAC CGGGCGCTGC TCACCGGCGC CGCAGGCGCA
CCGGCCCAAC GCTGGGACGT CAGCACCACG TCGTGGACGG CGCCATGGGG GTACGGCAAC
GCGCGGGTGT TCCTCCAGCC GCTGGTGGAT GGCGCACGGC CGAGCGGCCA CCGCGACGAG
GAGATCGACC GGATGGTCGA GCAGGCGGTC GACGCCGCCG ATCCCCGGGA GGCCGTGGCG
TGCTGGCAGC AGGTGCAGCG ACGGCTGCTG GCCGACGCGG CGGTCGTACC CCTGCTGTTC
CGACGCCCCA CCGACGCGGC ACCGCGCGGG CCGCGAGTGC GCCGCGCCGA CGCGCTGCCC
TCGCTCGGTG GCCTGGCCGA CCTCGGCGAC GTGCGCCTGG GGAACGAGCG GTGA
 
Protein sequence
MTQARSTAIS APGTRQRNIL RLLGTAAVHQ ADPAAAWSPA ERQLLRLTTR QLVSYPGAAD 
PTDWRALGPV GDLAVDVPST YNAGLGASHR SYVAHLRPDV WWDSPQPRRI TAHDVVRGFK
RLANPVTRHP ALPYFRSTIR GMDRYCDEYA AVVAGRPVTA ALLAAFANAH DLPGVFALDD
ETVVIELLRP ALDFPNMLAL SCASPAPAEY DAYLPGSTEL HAHLVASGPY RVATWQPGDT
IRLEPNPTWR SESDPVRHQR FDAVEFRVSG DGPRRLADQI SADVADLPWG VPVGEVSGYR
ADPFLVFNLR DPANPAMTTA AVRQVIDEAI DRSALARIAR VGDPWSAVRE AYTVVPPGND
GHLPSDPAAD PPAHGAPRER LTAAGHPNGL LLTAVCPDRT EELALARAWA ADLATAGIEV
RLVALDEATH RALLTGAAGA PAQRWDVSTT SWTAPWGYGN ARVFLQPLVD GARPSGHRDE
EIDRMVEQAV DAADPREAVA CWQQVQRRLL ADAAVVPLLF RRPTDAAPRG PRVRRADALP
SLGGLADLGD VRLGNER