Gene Sare_2999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2999 
Symbol 
ID5707609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3406849 
End bp3408108 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content65% 
IMG OID641272446 
Productextracellular solute-binding protein 
Protein accessionYP_001537814 
Protein GI159038561 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.662901 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000395876 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGATGC GTCATTCGGT TGCCGCAGTG GCAGCCGCAT CGGCGATGCT CCTAGCCGCG 
TGCGGGGGTA GCTCCGACGA CACCGCCGGC GGGTCGGTCG ACTCGCTGAA GTTCTACAAC
GACAAGGCGG CCTGGAAGCC GCAGTTCGAG GAGGTCAGCA AGGTCTCCCA GGACGAGATC
GGCCTCGCGC TCGAGCCGGT TGGCTACTCC GAAGCCAACC AGTACGCCGC GTTCATCCGG
TCCTCGTTCC GGACCAAGGA AAAGCCCGAC CTGTTCACCT GGCATACAGG CAAGGAACTT
GAGGACCTCG TCAGACAGGG ACTGGTCGCC GAGACCACGT CTCTGTGGGA CAAGGCCATC
GCCGATGGGG ACGTCCCCGA AGATCTCCGT GAGTACTTCA CGGTCGACGG CAAACAGTAC
TGCGTCCCCC TGCAGGCCGG CTACTGGGTG ATGTTCTACA ACAAGCGCAT TTTCGACCAG
GAGGGCATCA CGCCCCCGAG CACGTGGGCG CAGCTCGAGG CCGCTGCCGA GAGGCTCAAG
GGCGCGGGCG TCACGCCGTT TCACCAGACG AACGTCCTGT TCACGTTCTC GTGGTTCCAG
ACCCTCCTGA CAGGCACCGA CCCGGAGCTC TACGAGGCAC TGTCCACGGG TGAGGCGAAG
TACACCGACC CTGGCGTGGT GAGCGTCATG GACAAGTGGC GGGCCATGCT CGATAAGGGG
TACTTCAGCG ATCCGGGCTC CAAGACCGAC CCGCAGGTGA TGCTCAAGAA CGGCGACGTC
GCCATGATCA ACATGGGTAC CTGGTTCAAC GGCAACCTCA AGTCAGTCGG CATGGAGATC
GACAAGGACT ACGGGATGTT CGTCATCCCC AACGTCGACC CGTCGCTCGC CACCAGGCCG
ATGGTCGTCG AGGCCGGCCC GATGTGCACT GCTGCCGACG CGACGCACCG CGAGGAGGCC
GAGAGGTACT CGGCGTGGTG GTTCACCCCA CCGGCGCAGA CCGCCTGGGC GAACGCTCGC
GGTGAACTCT CGTTCAACCC GAGGGCCGAG GTCAGCGACG AGACCCTCGC CAGCCTCAGC
GACAAGATCA ACAAGGGTGA CTACAGGCTG ATGAACCGCT ACTTCGAGGC CGCACCCGTG
CCGGTGCTGA CCGCCGCGCT CGACGGATTC GGCGCCTTCG TCACCAAGCC CGGCGACCCG
ATGCCGGTGC TCAAGGAGGT GCAGGCGGCC GCCGACGCCT ACTGGGCCGA GCAGGGGTAG
 
Protein sequence
MKMRHSVAAV AAASAMLLAA CGGSSDDTAG GSVDSLKFYN DKAAWKPQFE EVSKVSQDEI 
GLALEPVGYS EANQYAAFIR SSFRTKEKPD LFTWHTGKEL EDLVRQGLVA ETTSLWDKAI
ADGDVPEDLR EYFTVDGKQY CVPLQAGYWV MFYNKRIFDQ EGITPPSTWA QLEAAAERLK
GAGVTPFHQT NVLFTFSWFQ TLLTGTDPEL YEALSTGEAK YTDPGVVSVM DKWRAMLDKG
YFSDPGSKTD PQVMLKNGDV AMINMGTWFN GNLKSVGMEI DKDYGMFVIP NVDPSLATRP
MVVEAGPMCT AADATHREEA ERYSAWWFTP PAQTAWANAR GELSFNPRAE VSDETLASLS
DKINKGDYRL MNRYFEAAPV PVLTAALDGF GAFVTKPGDP MPVLKEVQAA ADAYWAEQG