Gene Sare_0255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0255 
Symbol 
ID5705405 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp281946 
End bp283610 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content65% 
IMG OID641269783 
Productextracellular solute-binding protein 
Protein accessionYP_001535178 
Protein GI159035925 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.192685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00254463 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTGCAT CCAGGCCGAA GGTCGCTGTC GCGGCTGTCG CGGTCGCGGC CCTCGCGGTA 
TCCGGCTGCG CCGAGAGCGA CCGCGACGAT TCTTCCGGTG ATAGCAACAA CGACACCCTG
GTCTTCGGCG TCGCCGGAGA TCCGAAGGTG CTCGACCCGA GCTTCGCCAG CGACGGCGAA
TCGCTGCGTG TGTCCCGTCA GATCTTCGAG ACTCTGGTCC GTCCCGAGGA GGGCGGCACC
AAGGTGAGCC CCGGCCTGGC CGAGTCCTGG ACCCCGGACG CAGCCGGCAC GACCTGGACC
TTCAAGCTTC GCTCGGGCGT GAAGTTCCAC GATGGTACCG ACTTCGACGC CGAGGCCGTC
TGCGTCAACT TCAATCGCTG GTACAACGCC AAGGGCCTCA TGCAGAGCCC GGACGTGACC
ACGTACTGGC AGGACGTGAT GAACGGCTTC GCGCAGAACG AAAACGACAC GCTCTCGGAG
AGCCTGTTCA AGTCCTGCAC CGCCACGGAC GCCACCACGG TCGACCTGGC CTTCACCCGG
GTGTCCAGCA AGATCCCGGC CGCCCTGATG CTGCCGTCGT TCTCCATCCA CAGCCCGAAG
GCGCTGGAGC AGTACGACGC GAGCAACGTC GGCGGCACGG CGACGGACGT CAAGTACCCC
GAGTACGCGA CCGGGCACCC GACCGGTACC GGACCGTTCA AGTTCAAGGC CTGGGACATC
GCCAACAAGA CGCTCACTAT CGAGCGTAAC GACGACTACT GGGGCGAGAA GGCCAAGCTG
AAGACCCTTA TCTTCAAGAC CATCTCCGAT GAGAACGCCC GCAAGCAGGC GCTGCGGTCT
GGTGACATCC AGGGCTACGA CCTGGTCGGG CCGGCTGACG TCGAGCCGCT GAAGGCGGAG
GGCTTCAACG TCCTGACCCG GCCGGCGTTC AACATCCTCT ACCTGGGGAT GAACCAGAAG
GGGAACCCGA AGCTGGCCGA CCTCAAGGTG CGGCAGGCGA TCGCCCACGC GATCAACCGG
CAGGCCTTGG TCGACTCGAA GCTCCCCCCG GGAGCGAAGG TCGCGATGAA CTTCTTCCCG
GACACCGTCG AGGGTTGGAA CGGTGACGTC ACCACGTACG ACTACGACGT CGACAAGGCC
AAGCGGTTGC TGGCCGAGGC CGACGCGGCC GACCTGACGC TGCGGTTCCA CTACCCGACC
GAGGTCACCC GCCCGTACAT GCCGAACCCG AAGGACCTCT TCGAACTGGT GTCGGCGGAC
CTGCAGGCGG TCGGCATCAC GGTCGAGCCG ATCCCGCTGA AGTGGAGCCC GGACTACCTG
AACGCCACCA CGTCCGGCAG CGAACACGAC CTGCACCTGC TCGGATGGAC CGGCGACTAC
GGCGACGGCT ACAACTTCAT CGGCATCATG TTCGACCGGC AGAAGGACGA GTGGGGTTTC
GACAACCCTG CCCTCTTCGC TCAGTTCACG GATGCTGACA CCACCGCCGA CCGGGCGAGC
CGGGTGGAGA AGTACAAGGG CCTGAACAAG ACCATCATGG ACTTCCTGCC AGGCGTGCCG
ATCTCGCACT CGCCGCCGGC GATCGTCTTC GGCAAGGACG TGATCGGTGT CAAGGCCAGC
CCGCTCACCG ACGAGCGGTA CGCCAACGCC GAGTTCAAGT CCTGA
 
Protein sequence
MRASRPKVAV AAVAVAALAV SGCAESDRDD SSGDSNNDTL VFGVAGDPKV LDPSFASDGE 
SLRVSRQIFE TLVRPEEGGT KVSPGLAESW TPDAAGTTWT FKLRSGVKFH DGTDFDAEAV
CVNFNRWYNA KGLMQSPDVT TYWQDVMNGF AQNENDTLSE SLFKSCTATD ATTVDLAFTR
VSSKIPAALM LPSFSIHSPK ALEQYDASNV GGTATDVKYP EYATGHPTGT GPFKFKAWDI
ANKTLTIERN DDYWGEKAKL KTLIFKTISD ENARKQALRS GDIQGYDLVG PADVEPLKAE
GFNVLTRPAF NILYLGMNQK GNPKLADLKV RQAIAHAINR QALVDSKLPP GAKVAMNFFP
DTVEGWNGDV TTYDYDVDKA KRLLAEADAA DLTLRFHYPT EVTRPYMPNP KDLFELVSAD
LQAVGITVEP IPLKWSPDYL NATTSGSEHD LHLLGWTGDY GDGYNFIGIM FDRQKDEWGF
DNPALFAQFT DADTTADRAS RVEKYKGLNK TIMDFLPGVP ISHSPPAIVF GKDVIGVKAS
PLTDERYANA EFKS