Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_1410 |
Symbol | |
ID | 8882597 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 1493839 |
End bp | 1495287 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003510210 |
Protein GI | 291298932 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACACA AGAAGCCCGC GTCCGGCTAT CGGCAACCGG TTGCTCGCGC GGTCTCGGCC GCCACCAGCT TCGGCCTGGT CGGCGTGCTG GCCGCCGGGT GTCTGGGCGG CGGAAACGAC GCCGCGACCG ACCCGAACAA GAACGCCGAC GCCAAGGAGT ACACGCTCAC GATCACCTCC AACGCCATCG CCGACGGCAA GAACGCGATC GGCGCCAAGT GGATCGAGGA GTGGGTAATC CCGCAGTTCG AGAAGGCCCA GAAGAAGAAG GGCATCACCG CGAAGGTGAA GTTCGAGCCG CAGGGCGTCG ACGACGCCAA GTACAAGTCC AAGATCGGGC TCGACCTGGA CTCGGGCAAG GGCGCCGACG TCATCGACAT CGACGGCATC TGGGTGGGCG AGTTCGCCGA GTCGGAGTAC ATCCTGCCGC TGGAGAAGGT CGTCGGCGCC GACAGCATGG AGAAGTGGGA CGGCTGGAAG CAGATCCCCG ACAACGTCGA GGCCAACGGC ACCTACAAGG GCGACAAGTA CGGCGTGCCG AAGGGCACCG ACGGACGGGT CGTGTTCTAC AACAAGAACG TGTTCAAGAA GGCGGGACTG CCGGGTGACT GGCAGCCGAA GAGCTGGGCC GACATCATCG ACGCGGCCGA GAAGATCAAG AAGAAGGCCA AGGGCGTGAC CCCGTTGCAG ATCAACGCGG GCACCGCGAT GGGCGAGTCC ACCACGATGG AGGCGTTCCT GCCGCTGCTG GCGGGCACCG GCAACGAGAT CTTCCAGGAC GGCAAGTGGC AGGGCGACAC CGACGCCATC CGCGACGTCC TGGGCGTCTA CGAGGACACC TACCAGGGCG GCCTGGGCGA CGCGACGCTG CAGAAGGAGG CGCAGGGTCG GCAGAAGGCC CAGGAGCGGT TCTCCAAGGA CAAGGTCGGG ATCATGATGG AGGGCGACTA CTTCTGGCGT GACGTCGTCT CGCCCGGTTC CAGCGTCGCC CCGATGAAGA ACCGCGACTC CGATGTCGGG TTCGCCAAGA TCCCGTCGAT GAAACCCGGT TCGGGTGTGG ACGGTCAGGA CTTCGTGTCG ATGTCCGGCG GCGGCACCCA GGTGATCAAC CCCAACACCA AGTACCCGCA GCAGGCGTGG GAACTGATGC AGTTCATGGG CTCGGCCAAG GCCGTGAAGG AAGAGGTCGG CGACACGCCG CGCATCACCC AGCGCGAGGA CGTCAACTCC GACATCCTGG CCGACGACCC GCTGTTGTCC TTCATCGCCG AGGACGTGGT GCCGGTGACC CGGTTCCGTC CCTCCGACGG CAAGTACGTG AAGGTCTCGG AGGCGTTGCA GAAGGCGACC TACGCGGTCG TCGAGGGCAA GTCCGCGGCC GAGGCGGCCA AGGAATACCA GAAGGCTCTT GAGGACATCG TCGGTGCAGA CAAAGTCTCC GGAAGCTGA
|
Protein sequence | MRHKKPASGY RQPVARAVSA ATSFGLVGVL AAGCLGGGND AATDPNKNAD AKEYTLTITS NAIADGKNAI GAKWIEEWVI PQFEKAQKKK GITAKVKFEP QGVDDAKYKS KIGLDLDSGK GADVIDIDGI WVGEFAESEY ILPLEKVVGA DSMEKWDGWK QIPDNVEANG TYKGDKYGVP KGTDGRVVFY NKNVFKKAGL PGDWQPKSWA DIIDAAEKIK KKAKGVTPLQ INAGTAMGES TTMEAFLPLL AGTGNEIFQD GKWQGDTDAI RDVLGVYEDT YQGGLGDATL QKEAQGRQKA QERFSKDKVG IMMEGDYFWR DVVSPGSSVA PMKNRDSDVG FAKIPSMKPG SGVDGQDFVS MSGGGTQVIN PNTKYPQQAW ELMQFMGSAK AVKEEVGDTP RITQREDVNS DILADDPLLS FIAEDVVPVT RFRPSDGKYV KVSEALQKAT YAVVEGKSAA EAAKEYQKAL EDIVGADKVS GS
|
| |