Gene Sare_2193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2193 
Symbol 
ID5708188 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2525189 
End bp2526712 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content73% 
IMG OID641271674 
Productextracellular solute-binding protein 
Protein accessionYP_001537045 
Protein GI159037792 
COG category[R] General function prediction only 
COG ID[COG4533] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0522887 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.720743 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCC TCGACGAGCG GGAGCGCGGC AACAGCGGAC CTGCGGTGCC GCTGCGGTTC 
CTGGGGCCCG TCGACGGCGG ACTGCCCGAC GAACCGCCGT CACATGTGGT CGGCACCGGA
CAGATCCTCC GGCTCCTCAC CCGACAGCTG TTCGCGTACC CCGCAACCAC GGATCCGTCC
GACCGCCCGG TACCCGACGC CGCAGTAGCC CTACCGACCG TTGGCAACGG TGGAGTCAGC
GCCGATGGAC GCACCTGGCG CGTGCGGGTG CGCCGCGGCG TCCGGTGGGA CGCCGCAGCG
GCGCGGGAGT TGACCGCCGG GGACGTGGTG CGGGGGCTCA AACGCACCGC ACACCCGCTG
GCCCGGTCGA TCCGGCCCTA CTTCGGAGCC ATGATCGAGG GCATGGCCGA CTACTACCGG
GCCTATGACG AGGCGTTCGG GCACTGGGCG GGGCATGCTC CGGCGTTCGC CCAGTTCCAG
CAGCGCACCC ACATCCCCGG CCTCTGGGCG GAGGGAGACA GCGTGTGGCT GCGGCTCGTC
GAGCCCGCCA GCGACCTCGT CGACCTGCTC GCCACCGGGT TCGCCGCGGC GGCCCCGCGA
GAGTTCGACT ACCACGTGCC GGACAGCTCG CAGCTGTACC GGTGTACGCC CTCATCCGGG
CCGTACCGAA TCGCGACACG GCTGACGTCA GGTCGGGGCC TCGTCCTGGA GCCGAACCCC
CGGTGGGATC CGGCTACCGA CCCGGTACGT CGCCTGGTCT CTGGGCGGAT CGAGGTCGTT
GCGACGGCTG ACGGCAAGCC GCTGTCCCCG GCCACCGGAT TCGGGGTGCT CTCCTGGTCG
GTACCGTCGC CGCCGGCCCG GTTCGTCGGT AGGGGGTGCA GCTATTACCT CGCTGCCGGA
CCCGGCCGGG CCGGCCGGGC TCGGGCGCTA CGACACGCGT TGTCCTGGGC CATCGACCGG
GCGGCCCTGG CCGACGCCGT AGCCGGGGCG GGAGCCGAGG GCGTCGCCGT GCAGCATGGT
GTCCTGCAAC CCGGTCAGCG GGGCGCGGCG CACGTTGCCG GTGCCGCTCC CGATGTCGGT
GACCCGGACC GAGCCCGGCA CCTGCTCGCC GACGTGGGAC TCGGGGTGGG GGCCCGCCTG
TCCCTGGGGG TGCCGGCGGG ATCCCGGACG GCGGCGGTGG CCACCGGGCT GGCCGCGGTG
CTCGCGAGGT ACGGCATCGA CGTCGTGCGG GTACCGACGA CTGCGGTCGG TGTCGATCTG
GTGCTGCACG AATGGGCCCC GGCATGGGCC GGCAACGCCA ACCGTGACCT GCTGCACCGG
GTGTGGCCGG CCGACCGGGT GGCCACGGAA CCGGCAAGGT CCGCCGCGGA GGCGCTGCGC
GCGCTGGAAC CGGAGCGGGC GGCGACGTGG TGGCGTCGTT TCGATCGACT GGCCACCGCC
GATCTACGGC TGGTGCCACT GATCGTGGCT GCCTGGCCAC GGGCCCGGGC CGATGCCACG
GCCACCGGCG TGGCGCGATG GTGA
 
Protein sequence
MTTLDERERG NSGPAVPLRF LGPVDGGLPD EPPSHVVGTG QILRLLTRQL FAYPATTDPS 
DRPVPDAAVA LPTVGNGGVS ADGRTWRVRV RRGVRWDAAA ARELTAGDVV RGLKRTAHPL
ARSIRPYFGA MIEGMADYYR AYDEAFGHWA GHAPAFAQFQ QRTHIPGLWA EGDSVWLRLV
EPASDLVDLL ATGFAAAAPR EFDYHVPDSS QLYRCTPSSG PYRIATRLTS GRGLVLEPNP
RWDPATDPVR RLVSGRIEVV ATADGKPLSP ATGFGVLSWS VPSPPARFVG RGCSYYLAAG
PGRAGRARAL RHALSWAIDR AALADAVAGA GAEGVAVQHG VLQPGQRGAA HVAGAAPDVG
DPDRARHLLA DVGLGVGARL SLGVPAGSRT AAVATGLAAV LARYGIDVVR VPTTAVGVDL
VLHEWAPAWA GNANRDLLHR VWPADRVATE PARSAAEALR ALEPERAATW WRRFDRLATA
DLRLVPLIVA AWPRARADAT ATGVARW