Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_6282 |
Symbol | |
ID | 8887506 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | - |
Start bp | 6625447 |
End bp | 6626748 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003514995 |
Protein GI | 291303717 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAGTTT CGCGGCATCG CGGCCGAGTG GCCGCACTGA CAGCCCTCGC CGCCGTGGCC GTGTCCGGGT TGACGGCCTG TAGCGGCGAC GACCGCATCA AACTGACCGT CCAGGTGTTC GGCGGCGCCG GGTTCGGATA CGAGGACCTC GTCAAGGAGT ACGAGAAGGA CCACCCGGAC ATCGACGTCG ACTACCAGAT CGTCACCGAC GACTACGACA ACGAGTTCCG CCCGCAGGTG TTGCAGTGGC TGGAGGCCGG AAGCGGCGCC GGTGACGTCG TCGGCATCGA GGAGCAGGGC GTCGGGCAGA TGATGTCGCT GGGCGACGCG TGGGCCGATC TGTCCGAGTA CGGGCTGGAC AAGCGTGAGT CCGACTATCC GTCGTGGAAA TGGGAACAGG GGCACACCGC CGACGGCAAG CTGGCCGGGC TCGGCACCGA CGTCGGCGGC ATGGCGATGT GTTACCGCAC CGACCTGTTC AAGAAGGCCG GGCTGCCCAC CGACCGCGAG AAGGTAGCCG AGCTGTGGCA GGACTGGGAC GGTTTCACCA AGGCCGCGAA GAAGTTCACC GACTCTGATG TGGACGCTGC GTTCGTCGAC AGTCCCAACC AGCTCTACAA CATCCGCATG GTCCAGGAAG CCGGTGCCGC CGACGGCATC AGCTACTTCG ACCGCAAGGA CAAGTACGTC GCCGGTGACA CCGAGGCCGT GCGCACCGCC TTCGACTACG TGGCCGAGCT GCACGAACTC GGCGCCGTCG GCCAGTTCGA GAACTGGTCC GACGAATGGA ACTCCGCCAT GCAGGCGGGC GGCTTCGCCA CCATGGGCTG TCCCGCGTGG ATGATGGGCG TCATCGCCGA CACCTCCGGC AAGGAGAACA AGGGCAACTG GGACGTGGCC CAGGTGCCCG GCGGCAGCGG CAACTGGGGC GGTTCCTGGC TGGGCGTGCC CGCGCAGAGC GACCACCCCA AGGAGGCCGC CGAACTGGCC GACTACCTGA CCAAGCCGAA GTCCCAGGTC GCCGCCTTCG AGGCCATCAG CGCCTTCCCC AGCACCAAGG AGGGCCAGGA GGATCCCAAG GTCGCCGACC TGTCCAATGA GTACTTCAAC GACGCGCCGA CCGGCAAGCT GATCGCCGAC TCGGTCAAGG AGTTCAAGCC GGTCTACTAC GGCGAACTGC ACTCCGCGGT GCGCGCCGCC GTCGAGGACG TCCTGTTCGG TCTCGTCCAG GGCAGCTACA AACACGATGA GGCGTGGAAG GAGTTCGTGG CCGCGGGCCA GGAAGTGGTG GACACGGCGT GA
|
Protein sequence | MRVSRHRGRV AALTALAAVA VSGLTACSGD DRIKLTVQVF GGAGFGYEDL VKEYEKDHPD IDVDYQIVTD DYDNEFRPQV LQWLEAGSGA GDVVGIEEQG VGQMMSLGDA WADLSEYGLD KRESDYPSWK WEQGHTADGK LAGLGTDVGG MAMCYRTDLF KKAGLPTDRE KVAELWQDWD GFTKAAKKFT DSDVDAAFVD SPNQLYNIRM VQEAGAADGI SYFDRKDKYV AGDTEAVRTA FDYVAELHEL GAVGQFENWS DEWNSAMQAG GFATMGCPAW MMGVIADTSG KENKGNWDVA QVPGGSGNWG GSWLGVPAQS DHPKEAAELA DYLTKPKSQV AAFEAISAFP STKEGQEDPK VADLSNEYFN DAPTGKLIAD SVKEFKPVYY GELHSAVRAA VEDVLFGLVQ GSYKHDEAWK EFVAAGQEVV DTA
|
| |