Gene Snas_3696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3696 
Symbol 
ID8884895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp3932595 
End bp3934235 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content68% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003512446 
Protein GI291301168 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.772784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.201816 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCTGA CCAGCCTCCC CGACTGGTTG TCGTCGATCT CGTCGGCGGC CGCCCTGCTG 
TTCGCCGCGA TCGCCGCCGT GGCCGCCAGA AACGTGTACA AGATCGAGTC CGCCCGCGAC
CAGGCCAACG CCGCACTGCG CGCGAAACAG GACGCGCTGG AGCGGCGCGA CCAGGCGGCA
CTGGTGTCGG CGTGGTGGGG CTACTCGCCC GACGGCGGCC AACAGTCACA ACCGGGCTGG
GGCGTGTTCG TCCGCAACGC CTCCGAGACA CCGGTGTACA ACGCCGGATT CTCGGTGCTC
GACGTCCGCG ATCCCAACGT CAGCGAACGG TTCGACATGG TGGTGATACC CCCGGCGGCG
GAACCGGTGT TCCACCCGAG TCCGTTGAGC GCCTCCAAAC CCGCCGACTT CCGTGTCGAG
GTGACCTTCA CCGACTCGCG GGGACAGCGA TGGATCCGCG ACAAACAGGG CCGCCTCCAC
GAACTCGGAC CGGCGGTCGT GGTGTGGGGG GACGAGTCGC GCATCAACGC CCTGCGACGG
TTCTTCTCGG ACTTCCTCGC CTCCCACGGC GTCGAGATCC ACGCCCGCAC CGGCGACATC
GAAGACCTCC GCCGCGCACT GCTCGACGCC GACGACGCCA CCGCGGCACC CGACATCGTC
GTCGGACCGC ACGACTGGAT CGGCGGCCTG GTCGAACAGC AACTCATCGA ACCGCTGACG
CTGTCGCCAC AACACCGCGC GGCCTTCGAC GCGCTGTCCC TGGAAGCGAT GACCTACCAC
GGCGAGCTGT ACGGCATCCC CTACGCGTTC GACGCCCCGG TACTGCTGCG CAACACCGAT
CTCGTAGCCG AAGCCCCATC GTCCTTCGAG GACATGCTCC ACAAGGGCGA AGCGGTCCGC
CGCACCGGAA CCACCGAACT GCCGTTCGTG ATGCAGGTCC CCAGCCCCTA CTACACCTAT
GCCGTGCTGC TGGCCGCCGG CGGCGCGGTC TTCGGCCGAC GCGCGGACGG TGGCCTGGAC
ACCGGCAAAC TGGAGATCCT TTCCACCGCA ACCCGAGTCG CCCTCGACCG CTTCCGCGAC
CTGGGGCAGG CCGGACACCA GCACCTGCCC CCGAGGATCG GTCGCGAGGA GGCCATCGAC
CTGTTCGTCA GTGGACGGAC GCCATTCCTC ATCGGCACCT CGCGCGTACT GCTCGCCGCC
CAGAAAGCCA GACTGAACCT CGCGGTCGAC CCGGTCCCGT CATTCGACGG CGTCGACCCG
GTGCGCCCCA TGGTCTCCGT ACACGGTTTC TTCCTCACCC GATGCGGCCA CAACAAGGTC
ATCGCGCGAG ACCTCATCGT CGACCACCTC ACCCGCACCG AAGTCTCCAC CGCCCTGCAC
GAGGTCTGGC CCCACGTCCC GGCCCGACGC GACGCACTGG AACGCGGTCG CGACATCGAC
CCGGCGATCG GCGCGTTCTA CGAGGCGTTC CGCGCCGGAG ACCCGATGCC GTCGATACCC
CAGATGGGCG ACGTGTGGCA GTCCCTGCGC AGCGCCGTGA TCAAACTGGT CGACGGGGCC
GAGGCCGGGC CGGTCGCGCG CCAACTGTCC AAACAGCTCG AAGACCTGTT GAACGGCAAA
CACAACCGGG GGACACGATG A
 
Protein sequence
MNLTSLPDWL SSISSAAALL FAAIAAVAAR NVYKIESARD QANAALRAKQ DALERRDQAA 
LVSAWWGYSP DGGQQSQPGW GVFVRNASET PVYNAGFSVL DVRDPNVSER FDMVVIPPAA
EPVFHPSPLS ASKPADFRVE VTFTDSRGQR WIRDKQGRLH ELGPAVVVWG DESRINALRR
FFSDFLASHG VEIHARTGDI EDLRRALLDA DDATAAPDIV VGPHDWIGGL VEQQLIEPLT
LSPQHRAAFD ALSLEAMTYH GELYGIPYAF DAPVLLRNTD LVAEAPSSFE DMLHKGEAVR
RTGTTELPFV MQVPSPYYTY AVLLAAGGAV FGRRADGGLD TGKLEILSTA TRVALDRFRD
LGQAGHQHLP PRIGREEAID LFVSGRTPFL IGTSRVLLAA QKARLNLAVD PVPSFDGVDP
VRPMVSVHGF FLTRCGHNKV IARDLIVDHL TRTEVSTALH EVWPHVPARR DALERGRDID
PAIGAFYEAF RAGDPMPSIP QMGDVWQSLR SAVIKLVDGA EAGPVARQLS KQLEDLLNGK
HNRGTR