Gene Snas_1410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_1410 
Symbol 
ID8882597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp1493839 
End bp1495287 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content65% 
IMG OID 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003510210 
Protein GI291298932 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACACA AGAAGCCCGC GTCCGGCTAT CGGCAACCGG TTGCTCGCGC GGTCTCGGCC 
GCCACCAGCT TCGGCCTGGT CGGCGTGCTG GCCGCCGGGT GTCTGGGCGG CGGAAACGAC
GCCGCGACCG ACCCGAACAA GAACGCCGAC GCCAAGGAGT ACACGCTCAC GATCACCTCC
AACGCCATCG CCGACGGCAA GAACGCGATC GGCGCCAAGT GGATCGAGGA GTGGGTAATC
CCGCAGTTCG AGAAGGCCCA GAAGAAGAAG GGCATCACCG CGAAGGTGAA GTTCGAGCCG
CAGGGCGTCG ACGACGCCAA GTACAAGTCC AAGATCGGGC TCGACCTGGA CTCGGGCAAG
GGCGCCGACG TCATCGACAT CGACGGCATC TGGGTGGGCG AGTTCGCCGA GTCGGAGTAC
ATCCTGCCGC TGGAGAAGGT CGTCGGCGCC GACAGCATGG AGAAGTGGGA CGGCTGGAAG
CAGATCCCCG ACAACGTCGA GGCCAACGGC ACCTACAAGG GCGACAAGTA CGGCGTGCCG
AAGGGCACCG ACGGACGGGT CGTGTTCTAC AACAAGAACG TGTTCAAGAA GGCGGGACTG
CCGGGTGACT GGCAGCCGAA GAGCTGGGCC GACATCATCG ACGCGGCCGA GAAGATCAAG
AAGAAGGCCA AGGGCGTGAC CCCGTTGCAG ATCAACGCGG GCACCGCGAT GGGCGAGTCC
ACCACGATGG AGGCGTTCCT GCCGCTGCTG GCGGGCACCG GCAACGAGAT CTTCCAGGAC
GGCAAGTGGC AGGGCGACAC CGACGCCATC CGCGACGTCC TGGGCGTCTA CGAGGACACC
TACCAGGGCG GCCTGGGCGA CGCGACGCTG CAGAAGGAGG CGCAGGGTCG GCAGAAGGCC
CAGGAGCGGT TCTCCAAGGA CAAGGTCGGG ATCATGATGG AGGGCGACTA CTTCTGGCGT
GACGTCGTCT CGCCCGGTTC CAGCGTCGCC CCGATGAAGA ACCGCGACTC CGATGTCGGG
TTCGCCAAGA TCCCGTCGAT GAAACCCGGT TCGGGTGTGG ACGGTCAGGA CTTCGTGTCG
ATGTCCGGCG GCGGCACCCA GGTGATCAAC CCCAACACCA AGTACCCGCA GCAGGCGTGG
GAACTGATGC AGTTCATGGG CTCGGCCAAG GCCGTGAAGG AAGAGGTCGG CGACACGCCG
CGCATCACCC AGCGCGAGGA CGTCAACTCC GACATCCTGG CCGACGACCC GCTGTTGTCC
TTCATCGCCG AGGACGTGGT GCCGGTGACC CGGTTCCGTC CCTCCGACGG CAAGTACGTG
AAGGTCTCGG AGGCGTTGCA GAAGGCGACC TACGCGGTCG TCGAGGGCAA GTCCGCGGCC
GAGGCGGCCA AGGAATACCA GAAGGCTCTT GAGGACATCG TCGGTGCAGA CAAAGTCTCC
GGAAGCTGA
 
Protein sequence
MRHKKPASGY RQPVARAVSA ATSFGLVGVL AAGCLGGGND AATDPNKNAD AKEYTLTITS 
NAIADGKNAI GAKWIEEWVI PQFEKAQKKK GITAKVKFEP QGVDDAKYKS KIGLDLDSGK
GADVIDIDGI WVGEFAESEY ILPLEKVVGA DSMEKWDGWK QIPDNVEANG TYKGDKYGVP
KGTDGRVVFY NKNVFKKAGL PGDWQPKSWA DIIDAAEKIK KKAKGVTPLQ INAGTAMGES
TTMEAFLPLL AGTGNEIFQD GKWQGDTDAI RDVLGVYEDT YQGGLGDATL QKEAQGRQKA
QERFSKDKVG IMMEGDYFWR DVVSPGSSVA PMKNRDSDVG FAKIPSMKPG SGVDGQDFVS
MSGGGTQVIN PNTKYPQQAW ELMQFMGSAK AVKEEVGDTP RITQREDVNS DILADDPLLS
FIAEDVVPVT RFRPSDGKYV KVSEALQKAT YAVVEGKSAA EAAKEYQKAL EDIVGADKVS
GS