Gene Sare_3207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3207 
Symbol 
ID5705538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3697273 
End bp3698676 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content65% 
IMG OID641272638 
Productextracellular solute-binding protein 
Protein accessionYP_001538005 
Protein GI159038752 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCTA CCCCCGATCT CAACCGTCGG ACCCTGTTGC GTCGCGCCGC GGCCGCGGGT 
CTGCTGACCC TCCCGGCCGC CGGCGTGCTC AGTGCCTGCG CCGGCAGTGA GCCGGCCCAG
GACGACAGCT CCGGTGCCGC GAAGAGCAAG GACAACCCGT TCGGCGTCAA GGACGACAGC
TCTGTCAAGG TGGTCATCTT CAACGGCGGG CTGGGCGACC AGTGGGCCAA GGAGGACGAG
GCCGTCTTCA AGGCCAAGTA CCCGAACATC ACGGTCAACA TGTCGTCGAC CCAGAAGATC
AAGACCGAAG AACAGCCGAA GATGGCGACC CGGCCCAGCG ACGTCGTCAT GAACTCCGGC
GCCGACATCA TGGACATCAG CACCCTGATC AACGAGAGCG CGATCGAGCC GCTGGATGAC
CTGCTCGACG CCCCGGCCTG GGACAGCGAG GGCACGGTGG CGGACACCCT GCTGCCGGGG
ACCGTCAGCG ACGGCACCTT CCAGGGCAAG TTCTACGTGG TGAACATCGC GTACACGGTG
TGGGGTAACT GGTACAACGC CGCCCTGTTC GACAAGGAGG GCTGGCAGCC GCCGAAGACC
TTCGACGAGT TCTTCGCCCT CGCGCCGAAG ATCAAGGCGA AGGGCATGGC CCCGTACGTC
TACGACGCGG TGCACGGCTA CTACCCGCGC TGGGCGCTGA TGGCGACGAT CTGGAAGTCC
GCCGGTAAGC AGGCCGTGAT CGACATCGAC AATCTCAAGG AGAACGCCTG GAAGGCCGAT
GGGGTGCTGC CGGCCCTTCA GGCGTGGGAG AAGCTGGTCA AGGACAAGCT GCTGCTCCCC
GGCAAGCTCG ACCACACCCA GTCGCAGCAG GCGTGGCTCG ATGGCAAGGC CGCGTTCATC
CAGCTCGGTA CCTGGCTCAA GAACGAGATG GCGGAGACCA TCCCGCCGGG CTTCGAGATG
AAGCTGTCGG ACTACTGGAG CCTGGGGGCG AGCGACAAGG CGCCGAACGA CGTCTACGCC
GGTGCGGGTG AGGGCATCGT CGTGCCGTCG AAGGCGCCGA ACAAGGCCGC TGCCAAGGAG
TTCCTGCGGG CAATGCTCTC CAAGGAGGGC TCGGCGAAGT TCGCCGAGCT GACCAAGTCC
CTCGCCTCCA CCAAGGGCTC CGGGGACAAC GTCCAGGATT CGGCGCTGGC CAGCGCGAAC
GAGCTGATGA GCAACGCCCC CCAGGATCTG GTCTCGTTCA AGTTCTGGAA CTTCTACGCC
GACCTGGACA AGGCGAGCCA GAACTTCTGT GCGGAGTTGA TGGCCGGCCG GCTGACCGCT
CAGGAGTTCG TCGACGGCAT GCAGGCGGCC GCCGACAAGG TCGCCAAGGA CTCGTCCGTC
AAGAAGCAGA CCCGCTCCGC CTGA
 
Protein sequence
MSATPDLNRR TLLRRAAAAG LLTLPAAGVL SACAGSEPAQ DDSSGAAKSK DNPFGVKDDS 
SVKVVIFNGG LGDQWAKEDE AVFKAKYPNI TVNMSSTQKI KTEEQPKMAT RPSDVVMNSG
ADIMDISTLI NESAIEPLDD LLDAPAWDSE GTVADTLLPG TVSDGTFQGK FYVVNIAYTV
WGNWYNAALF DKEGWQPPKT FDEFFALAPK IKAKGMAPYV YDAVHGYYPR WALMATIWKS
AGKQAVIDID NLKENAWKAD GVLPALQAWE KLVKDKLLLP GKLDHTQSQQ AWLDGKAAFI
QLGTWLKNEM AETIPPGFEM KLSDYWSLGA SDKAPNDVYA GAGEGIVVPS KAPNKAAAKE
FLRAMLSKEG SAKFAELTKS LASTKGSGDN VQDSALASAN ELMSNAPQDL VSFKFWNFYA
DLDKASQNFC AELMAGRLTA QEFVDGMQAA ADKVAKDSSV KKQTRSA