Gene Snas_0173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_0173 
Symbol 
ID8881350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp174918 
End bp176249 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content68% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003508986 
Protein GI291297708 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACGT CGCCACATCG CCCACCCGCG GGCCTGAGTC GCCGGTCGCT GCTGGGTGGT 
GCGGCGGCGC TGGCGGCGGT TCCGCTGTTG TCGTCCTGCG TGGGTTTCAA CACCAGCGGC
GGCAAGGCCG GCAGCCTCGA CTTCCTGTCC ACCCAGTTCA CGCCGGTGGA GGAGAAGCAG
CGGTTCGAGA AGGTCCTGGC CGACGCGAAG GTCAACGCGG CGTACAACGC GGTGGAGGGG
AACGTGTTCG CGTCCACGCT GACCTCGCAG GCCGAGGCCG GGAGCGTGCA GGTGAGCCTG
GCCGGGGCCA TGCACGGCGA ACTGGCGCCG TTGGCCGACC GGTTCACCGA CGTAGACGGG
CTGTTGAAGG GGAAGCTGGC GCAGGCCGAG TATCCGAAGG ACCTGCTGGA GTTGGCCAAG
GCCGGGGGTT CGACCGCGAA GTACATCCCG TGGATGCAGG CGTCCTATGT GGTCGCCGTC
CACAAGCGGG CGCTAGAGTG GCTGCCCTCG GGGGCCGACG TCAACTCGCT GACCTACGAC
CAGTACCTGG ACTGGGCGAT CGCGGCGCGA AAGGCCAACG GCAGCCCGGT CTTCGGGTTC
CCCGCCGGGC CGGACGGGCT GTACGCCCGC TTCGTCCAGG GGCATCTGCT GCCGAGCTTC
ACCGGTGGGC AGGTCACGAC GTTCCGCAGC GCGGACGCCA TCGACGCGTG GAAGTACATG
AAGGAGCTGT GGGCGAACTT CGTCCCCGCC TCCACCAACT ACGACAACAT GCAGGAGCCG
TTGGCCAAGG GCGAGGTCAT GGTCGCCTGG GACCACATCG CCCGCATCAT CGAGGCGCCC
AAGGGCAATC CGGACGAGTG GCTGCTGGTG CCGTCTCCGA AGGGCCCCAA GGGTTTGGGG
TACATGCTGG TGGTCGCGGG GTTGGCGATC CCCGACGGCG CCCCCGATCC TGACGGCGCC
ACCGACGCGA TCCTGTCACT GTCCGAACCG GACGTACAGA TCGAGGTGCT GAAGCAGAAC
ACCTTCTTCC CGGTGTCCGT CACGGAACTG CCCGACGACC TGGAGGGCGC GACGAAGCTG
GCCGCCGAGG CGATCACCGC GCAGCGGGAG GCCAAGGACG CGATCATGGC GCTGCCGCCG
GTGGGAACCG GGGAACGCGA CGGCGAGGTC ACCGCGGTGT TCCAGAACTC GTTCCGGCAG
ATCTGCCTGG ACGACCGATC GATCAAGTCC GTCGTGGACG AACAGGCGGC CGAGTTGCAG
TCCATTCTCG ATGACCTCAA GATCCCCTGC TGGGCACCCG ATCCGGCCGA AGCCGTCTGC
GAGGTGGGCT GA
 
Protein sequence
MATSPHRPPA GLSRRSLLGG AAALAAVPLL SSCVGFNTSG GKAGSLDFLS TQFTPVEEKQ 
RFEKVLADAK VNAAYNAVEG NVFASTLTSQ AEAGSVQVSL AGAMHGELAP LADRFTDVDG
LLKGKLAQAE YPKDLLELAK AGGSTAKYIP WMQASYVVAV HKRALEWLPS GADVNSLTYD
QYLDWAIAAR KANGSPVFGF PAGPDGLYAR FVQGHLLPSF TGGQVTTFRS ADAIDAWKYM
KELWANFVPA STNYDNMQEP LAKGEVMVAW DHIARIIEAP KGNPDEWLLV PSPKGPKGLG
YMLVVAGLAI PDGAPDPDGA TDAILSLSEP DVQIEVLKQN TFFPVSVTEL PDDLEGATKL
AAEAITAQRE AKDAIMALPP VGTGERDGEV TAVFQNSFRQ ICLDDRSIKS VVDEQAAELQ
SILDDLKIPC WAPDPAEAVC EVG