Gene Snas_4693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_4693 
Symbol 
ID8885899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp5002431 
End bp5003804 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content66% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003513429 
Protein GI291302151 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.363745 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTCAG CAAAGCCACT TGGACCCGTG TTGGTCCTGT TGACCGCGAC GGTCCTCGTC 
TCCGGTTGTT CCGGCAGCGA CGACTACGGC GAGGACGGAC GGCTTCAGGT CGAGGTCGCC
ATGGACGCGG GCCTGGAGAA GAGCGCCAAG AAGGTGCTCG ACGAACGGGT CAAGCACTTC
GAGAAGGCCA ATGAGGACAT CGACATCATC CCCCAGGAGT ACACCTGGGA AGCGACCACG
TTCACCGCGC AGCTCGCCGG TGACACGCTT CCCGATGTCT TCACCGCGCC GTTCACCGAC
GGCCGCGGCC TGATCGAACG CAAGCAGATC GCCGACATCA GCGCGCTGGT CGCCGACCTG
CCCTACGCCG ACAAGTTCAA CCCGGGGATC GCCAAGGCCG GTTCGGATGC CAAGGACCGG
ATCTGGGCGG TACCGGTCTC GGCCTACGGC CAGGCGCTGC ACTACAACCG CGCCCTGTTC
GACGAGGCCG GGCTCGATCC GGACAAGCCG CCCACGACCT GGAAGGAGGT CCGCGAGGCA
GCCAAGAAGA TCGCCGACGA GACCGGCGAG GCCGGGTACG TCCAGATGAC CAAGGACAAC
ACCGGCGGCT GGATCCTGAC CACACTGGAC AACGCCCTCG GCGGCCGGGT CGAGGAGCTC
GACGGCGACA AGGCCACGTC CACCATCAAC ACACCACAGA TGGTGGAGGC GCTGGAGTTG
TTGCGGGACA TGCGCTGGAA GGACGACAGC ATGGGCGACA ACTTCCTGCA CGACTGGGCC
GGGTCCAACC AGGACTTCGC GGCCGGGCGG ATCGGCATGT ACATCACCGG CGGCGGCAAC
TACGGGCAGC TGATGGCGCA GAACGACATC AAGCCGGACG ACTACGGCGT GACGGTGGTG
CCGCTGTCGG ACTCCCCCGA CGCCGGGGTA CTGGGCGGCG GGACGCTGGC GGCGGTGAAC
GCCTCCGCCA GCGAGGAGGT CAAGGCGGCG GCGGTGAAGT GGATCGACTT CTACTACATG
GAGAAGCTCA CCGACGCCAA GGCCGCCAAG CTGGACGCCA AGACCACCGC CGAGTCCGGG
CAGGCCGTGG GGGCGCCGCT GCTGCCGGTC TTCGACAAGA AGACCTACGA CAAGCAGCAG
GAGTGGATCG CCGACTACAT CAACGTGCCG GTGGACCAGA TGAAGCCCTA CACCGACAAC
ATGTTCGACC AGCCGTTGGC GACCGAACCG ACGAAGTCCA CCCAGGAGGT CTACGGCGTC
ATGGACACGG TGGTGCAGTC GGTGCTGACC GAAGAGGACG CCGACATCGA CAAGCTGCTG
GACACCGCCG AGAAAGAGGC GCAGGCACTG CTCGACAAGG CCGCGAAGAA GTGA
 
Protein sequence
MLSAKPLGPV LVLLTATVLV SGCSGSDDYG EDGRLQVEVA MDAGLEKSAK KVLDERVKHF 
EKANEDIDII PQEYTWEATT FTAQLAGDTL PDVFTAPFTD GRGLIERKQI ADISALVADL
PYADKFNPGI AKAGSDAKDR IWAVPVSAYG QALHYNRALF DEAGLDPDKP PTTWKEVREA
AKKIADETGE AGYVQMTKDN TGGWILTTLD NALGGRVEEL DGDKATSTIN TPQMVEALEL
LRDMRWKDDS MGDNFLHDWA GSNQDFAAGR IGMYITGGGN YGQLMAQNDI KPDDYGVTVV
PLSDSPDAGV LGGGTLAAVN ASASEEVKAA AVKWIDFYYM EKLTDAKAAK LDAKTTAESG
QAVGAPLLPV FDKKTYDKQQ EWIADYINVP VDQMKPYTDN MFDQPLATEP TKSTQEVYGV
MDTVVQSVLT EEDADIDKLL DTAEKEAQAL LDKAAKK