Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_6149 |
Symbol | |
ID | 8887371 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 6507528 |
End bp | 6509228 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003514865 |
Protein GI | 291303587 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.784749 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAACA CCAGATCCAA ACCCGTACGC CTGCTCGCCG TACTGACGGC CGCTGGCGTC GCCGCCGCGC TATCGGCCTG TGCCGCCGAT CCCGGCGCCG CCAAGGACAA CGACAAGACC GCGTTCAAGT TCGCCACCGC CAGCGAACCG ACCTCTTTGG ACCCGGCGCT CGCCTCCGAC GGTGAGACCT TCCGGGTCAA CCGCCAGGTC GTGGAGACCC TGGTGGAGCA CAAGACCGGT GGCGACGAGC TGGTTCCCGG GCTCGCCAAG GAGTACTCGC CCAGCAAGGA CGGCCGGACC TGGAACTTCA CCCTCAACGA GGGCGTCAAG TTCCACGACG GCGACGACCT GACCGCCGAA GCCGTGTGCG CCAACTTCGA CCGTTGGTAC AACTGGAAGG GCGTCTACCA GAACCCGGCG CTGTCGGGCT ACTGGCAGGA CATCATGGGC GGCTTCGCCA AGAACGAGAA CAAGGACCTG CCCAAGTCCA ACTACGACGG CTGTAAGACC GACGGCGAGT TCAAGCTGTC CATCAAGGTC AAGGAACCCA CCGCGAAACT GCCGGGCGGC TTCTCACTGT CGTCGCTGGG CATCCTGAGC CCGAAGACGC TGGCCGCGGC CGACAAGCAG CAGCCCAAGC AAGAGGGCGA GTCCATCAAG TACCCCGACT ACAGCCAGGA GGTCGGCACC ATCGCCGGCA CCGGGCCGTA CGAGTACTCG AAGTGGGACA AGAGCCAGCA GGAAGTCACC ATCAAGGCCA ACAAGGACTA CTGGGGCAAG ACCAACGCCA AGATCAAGAC CATCATCTTC AAGGCCATCT CCGAGGAGAA CGACCGCAAG TCGGCCCTGA TCTCCGGCGA CGTCGACGGT TACGACCTGG TGGCGCCGCA GGACATCGAC GACCTGAAGA AGAAGGACAT GAACGTCCTC ACCCGGGATC CGTTCAACAT CTTCTACATC GGTCTGAACC AGAAGCTCGT CGACCTGAAG ACGAACAAGG GCAAGAAGAC CGTCTTCGCC GACAAGGACG TCCGCAAGGC CATCGCGCAC TCCATCAACA AGGACAAGAT CATCAAGCAG ATCTATCCGA AGGGGACCGA GGCCGCCACC CAGTTCCAGC CGCCGTCGCT GGACGGCTGG TCGGACAACG TGCCGAAGTA CGAGTACGAC AAGGACAAGG CCAAGGCGCT GCTCAAGAAG GCCGGGCAGT CGGACATGAA GATCGACTTC TGCTACCCGA CCAAGACCAC GCGGCCCTAC ATGCCGGACC CGAAGTCCAT CTTCGACAAC ATGAAGTCGG ACCTGGAGGC CGTCGGCATC ACCGTCGAGG AGAAGCCGCT GCAGTGGTCG CCGACCTACG GTGACCAGAC CTCCGCCGGT GGTTGCAGCA TGTACATCCT GGGCTGGACC GCCGACTACG CCGAGGCGTT CAACTTCAAC GGCACCTGGT TCTCGCAGTA CACCCCGGCC TGGGGCTTCA AGGACGACAA GGTCTTCGAC GCGCTGGCCA AGGCCAACGC CGAAGCCGAC CCCGCCGAAC GCGCCAAGCT GCACCAGAAG GCGAACGAGG CCATCATGGA CTACGTCCCC GGTGTCCCGA TCTCGCATTC CTCGCCGTCC ATCGCCTTCG CCGACTACGT GAAGGCTCCG ACCCTGTCCC CGCTGACTCA GGAAAACTTC GCTGAGACCA GCTTCAAGTA A
|
Protein sequence | MRNTRSKPVR LLAVLTAAGV AAALSACAAD PGAAKDNDKT AFKFATASEP TSLDPALASD GETFRVNRQV VETLVEHKTG GDELVPGLAK EYSPSKDGRT WNFTLNEGVK FHDGDDLTAE AVCANFDRWY NWKGVYQNPA LSGYWQDIMG GFAKNENKDL PKSNYDGCKT DGEFKLSIKV KEPTAKLPGG FSLSSLGILS PKTLAAADKQ QPKQEGESIK YPDYSQEVGT IAGTGPYEYS KWDKSQQEVT IKANKDYWGK TNAKIKTIIF KAISEENDRK SALISGDVDG YDLVAPQDID DLKKKDMNVL TRDPFNIFYI GLNQKLVDLK TNKGKKTVFA DKDVRKAIAH SINKDKIIKQ IYPKGTEAAT QFQPPSLDGW SDNVPKYEYD KDKAKALLKK AGQSDMKIDF CYPTKTTRPY MPDPKSIFDN MKSDLEAVGI TVEEKPLQWS PTYGDQTSAG GCSMYILGWT ADYAEAFNFN GTWFSQYTPA WGFKDDKVFD ALAKANAEAD PAERAKLHQK ANEAIMDYVP GVPISHSSPS IAFADYVKAP TLSPLTQENF AETSFK
|
| |