Gene Snas_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_1041 
Symbol 
ID8882226 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp1101290 
End bp1102540 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content68% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003509844 
Protein GI291298566 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.172945 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.776305 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTGAATC CGACGCGTCG ATTGTGGTGT GCCGCGCTGG CCTGCGTCAC CGCGGCCGGG 
CTGCTGGGGG GCTGCGGCAG CGAGGCCAGC GCGCGGGTCG TCAACCTCTA CAATGCCCCG
CAACAGAACC TGTCCAAGAT CGTCGAACGC TGCAACCGGC TCGCCGACGG TGACTACAAG
ATCGTCCTCA ACACCCTGCC GCGCGACGCC GACGGGCAGC GCGAGCAGAT GGTGCGGCGG
CTGGCCGCCG AGGACACCGG CATGGACGTG CTGGGCATCG ACATCACCTG GACCGCCGAG
ATGGCCAGCG CCAAGTGGAT CCTGCCGTGG AAGGGCGAGC ACGCGGCCCA GGCCAAGGCC
GGGGTCGCCA AGGCGCCGTT GAAGACCGCC ATGTTCGAGG ACCGGATGTA CGCCGCGCCG
TCCAACACGA ACGTGCAGCT GCTGTGGTAC CGCTCCGACC TGATGCCCGA ACCGGCGCAG
ACCTGGGAGC AGCTGATCGG CGTCGCCAAG AAGCTGAAGC AGGAGGACAA GGCGCACTAC
CTGGAGGTCA CCGGCGCCCA GTACGAGGGC CTGGTGGTGT GGTTCAACTC GATGGTGGCC
GCCGCGGGCG GCAGCATCCT CAACGCCGAC GGCGACAAGG TGGAACTGGG CGAACCGGCC
GTGAAGGCGC TCAAGACGAT GCGGACCTTC GCCCGGTCGG ACGCCGCCGA CCCGTCGCTG
TCCAACACCC AGGAGGACAC CGCCCGGCTG GCGGTGGAGA GCGGCTCGGG ATTCGCGGAA
CTGAACTGGC CGTTCGTGTA CGCGGCCATG CAGGCCAGCG GCAAGGACTT CGCGAAGGAC
TTCAGGTGGG CGCCGTATCC GGGCATCGAC GGGCCCGGCA ACGCGCCGCT GGGCGGCTCC
AACTTCGCCA TCAGCAAGTA CTCCACCAAC CGCTCCGAGG CCTTCGACGC GGCGCTGTGC
CTGCGCGACA AGGAATCCCA GCGCATGTCG GCGGTACTGG ACGGACTGCC GCCCACGATC
GAGTCGGTGT ACGACGCCAA GGGCATGGCC AAGGCTTACC CGATGCGGGG CGAGATCCTC
AAGGCGCTGG AAACCGCGGT GCCGCGTCCG GTCACCCCGG TGTACCAGAA CGTGTCCACG
GTGACGTCGA AGTACCTGTC GCCGCCGTCC TCGATCCAAC CGGTGGAGAC CGAACGCAAA
CTGCGCGAAC AGCTCGTCAA GGCCCTCAAC TCCGAAGGAG TGCTGCCGTG A
 
Protein sequence
MLNPTRRLWC AALACVTAAG LLGGCGSEAS ARVVNLYNAP QQNLSKIVER CNRLADGDYK 
IVLNTLPRDA DGQREQMVRR LAAEDTGMDV LGIDITWTAE MASAKWILPW KGEHAAQAKA
GVAKAPLKTA MFEDRMYAAP SNTNVQLLWY RSDLMPEPAQ TWEQLIGVAK KLKQEDKAHY
LEVTGAQYEG LVVWFNSMVA AAGGSILNAD GDKVELGEPA VKALKTMRTF ARSDAADPSL
SNTQEDTARL AVESGSGFAE LNWPFVYAAM QASGKDFAKD FRWAPYPGID GPGNAPLGGS
NFAISKYSTN RSEAFDAALC LRDKESQRMS AVLDGLPPTI ESVYDAKGMA KAYPMRGEIL
KALETAVPRP VTPVYQNVST VTSKYLSPPS SIQPVETERK LREQLVKALN SEGVLP