Gene Snas_1828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_1828 
Symbol 
ID8883019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp1918274 
End bp1919641 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content63% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003510617 
Protein GI291299339 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.218215 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGTG CACGTGCCCT CCTCGCCCCC GCGCTGATCG CCGCCCTTCT CGGTGGAATG 
ACAGCCTGCG CCGGTGAAGA GAAGGACCCC GGTGAGATCG ACGTATGGAT CGGATTCGTC
GACCACCGTC TGGACTGGAT GAAGGACCGC GCCAAGGAGT TCGAGGACGA GCACCCCGGA
TACAAGGTCA ACATAACCCC GTACAAGGAC TACCCGACGC TGTGGGACAA ACTGACCGCC
GCGGCCGAGC AGGGTGAGCC GCCGACGATC GCGCAGAACT TCGAGGCCGC GACCCAGGAG
TCGCGCGACG CCGTGAACAG CGAGGGGGAG CCGCTGTTCG CCTCGGTGGA GAAGGAGATC
GACGGGCGCA AGGAGATCCT CGGTGAGAAG GTGGTGCTCG ACGACGTCAT CGACGCCACC
CGCAACTACT ACACTTTGGA CGGCGAGTTC GCGTCGATGC CGTGGAACAC CTCCACCCCG
GTGTTCTACT CCAACACCGA CATCCTGAAG AAGGCCAAGA TCAAGGAGGC CCCCAAGACC
TGGGAGGACC TCCAGGCCGC CTGCGACAAG ATCGACAAGA TGAAGGACGG CCCCAAGAAC
TGCATCACCT GGCCCAACCA GGCGTGGTTC CTGGAGCAGC CGCTGGCCGA GCAGGGCGGG
CTGTTGGTCA ACAAGGACAA CGGCCGTTCC GGCCGGGCCA CCAAGATCGA CCTGACCAGT
GACAAGTTCC TGGCCTGGGC GAAGGTGTGG GCCGACATGT CGAAGAAGAA GCAGTACTCG
TACTCCGGCA AGCAGGAGGA CTGGATCACC CCGACCAAGA ACTTCACCGG TCAGGAGGTC
GCGTTCATGA TGACCTCCTC GGCGGAGGCC TCGGTGGTCG CCAAGCAGGC CAAGGAATCC
GACTTCGGCT TCGAGGTCAC CAAGATGCCG CTGAAGAAGG GAGCCCCTTA CTCGGGCAAC
TTCATCGGCG GCGCGACACT GTGGATGACC GCGGGCCTGG AGAAGAAGAC CTCCGACGGC
GCGCTGGCCT TCATGCAGTA CATCAACAAC CCCGAGAACG CCGCCGACTG GCACAAGATC
ACCGGCTACG TCCCCGTGAC CAAGAGCGCC GAGGAACTCC TGGAGAAGGA GAAGTGGTTC
GACGACAACC CGCACCACAA GGTCGCCATC GAGCAGCTGG CCGCCACCGA CGGCTCCCCG
GCCGCCACCG GACCGATCGT CGGCAACTTC GTGGCCATCC GCAAGGAGAT GCAACAGGCC
ATGGAGGACA TCATGAACAA CGGTGATGAT CCGGCCGAAC GGTTCAAGGA AGCCGAGAAG
GCCTGCCAGA AGCTCCTGGA CGACTACAAC GAGCTCAGCG CGGGCTGA
 
Protein sequence
MRRARALLAP ALIAALLGGM TACAGEEKDP GEIDVWIGFV DHRLDWMKDR AKEFEDEHPG 
YKVNITPYKD YPTLWDKLTA AAEQGEPPTI AQNFEAATQE SRDAVNSEGE PLFASVEKEI
DGRKEILGEK VVLDDVIDAT RNYYTLDGEF ASMPWNTSTP VFYSNTDILK KAKIKEAPKT
WEDLQAACDK IDKMKDGPKN CITWPNQAWF LEQPLAEQGG LLVNKDNGRS GRATKIDLTS
DKFLAWAKVW ADMSKKKQYS YSGKQEDWIT PTKNFTGQEV AFMMTSSAEA SVVAKQAKES
DFGFEVTKMP LKKGAPYSGN FIGGATLWMT AGLEKKTSDG ALAFMQYINN PENAADWHKI
TGYVPVTKSA EELLEKEKWF DDNPHHKVAI EQLAATDGSP AATGPIVGNF VAIRKEMQQA
MEDIMNNGDD PAERFKEAEK ACQKLLDDYN ELSAG