Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Snas_1142 |
Symbol | |
ID | 8882328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Stackebrandtia nassauensis DSM 44728 |
Kingdom | Bacteria |
Replicon accession | NC_013947 |
Strand | + |
Start bp | 1220727 |
End bp | 1222577 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003509945 |
Protein GI | 291298667 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.163498 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.236826 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACT CAAGTTTCAC GGTGGCGTCG ACGAGCGTGT CCCGTCGCCG GCTGTTCCAG GCGGCGGGTC TCGGCGCCGC GGGTGCGGCC GGACTGGGAA CCATGACCGC CTGCAAGGCC GACCCGGGTA TCCAGGGCAA AGGCGAGTTC CACGGCGGCT ACCCCTACGA GACCCCGCCG GACGGCCACT TCAACACCGC CGGAGCCCCG TACGCGGTGG TGCCGCACGT GTTCGTCGAG GGCATGTACC TGGACCTGAC CTGTATGCCC GGCGGTTACT ACTGGTGGGA CAAGCAGGAA TGGGAGTACT TCCTGGCCGA GAGCTTCGAG CTCGACGACA AGGAGAACAC CTTCACCCTC AAGGTGCGTG ACGGTCTGAA GTGGAGCGAC GGGGAGCCGC TGACCGCCAA GGACTTCGAG ACCACCTACT GGTTGTGCTG GATCCGCAGC AACCCGATGT GGAAGTCGCT CGACAGCCTC AAGGCCACCG ACGACATGAC CATCGAGGGC AAGCTGAGCA ACCCGTCCTC GGTCATCGAG CGCTACATGC TCAAGACCAA CGTGGCGCCC AGCCACAACA AGGACTCCAA GATGGGCAAG ACCTACCGGG ACTTCGCCGA GGCGGCGATG AAACTGCACG AGGACGGCAA GGACCAGACC TCCAAGGAGG GCGAGAAGCT CGGCGCCGAC CTGGCCAAGT TCCGGCCGGA GAACCTGCTG ACCTCGGGTC CGTTCAAACT GGAGAAGAAG GACTTCACCA AGACCCAGAT GGTGTTGACC AAGAACAAGA ACGGCTTCAA CGCCGACAAG GTCAAGTTCG ACAAGGTCGT CGTCTACGAG GGCGAACTGC CGCAGATCAC GCCGCTGGTC AAGGACAAGT CCGTCGACTA CGCCTCCCAC GGTTTCGCGC CCAACCAGGA GAAGAAGTTC AAGCGCGACG GTCACAAGAT CGTCCGGCCG CCGGTGTACT CCGGGCCGTC GCTGTACATC AACTTCAAGG AGGTGCCCGA GTTCAAGGAC GTGCTGACCC GGCGCGCCAT CGCGCACGCC ATCAACCGCG CCGACGCGGG CAGTATCGCC CTGGGCGACT CCGGTCCGGC CGTGAAGTAC ATGGCGGGCT TCTCCGACAT CATGGTCCCC GACTGGATCT CCAAGGAGGA CCAGGACGCC TTCGACACCT ACGAGCACGA CCTCGACAAG GCCGCCGAGC TCATGGAGAA GGCGGGCTGG AAGAAGGACG GTGACGTGTG GGCCAGGGGC GACAAGAAGA TGGACTACGA GATCAAGTGG CCCTCCACCT ACGCCGACTG GTCGGCCTGC GGTGACGCCA TCGTGGACCA GCTGACCGAC TTCGGCATCA AGCTGACCGC GCAGCCCGTC GACGAGGAGC AGTACCTCGA GGAGATCGAC AAGGGCGAGT TCCAGATGGC CATCAACGTC TGGGGCTCCT CGCAGCACCC GCACCCGCAC TTCGCGTTCG TCGCCGACCT GTTCACCCAC AACACCCCGA TCGCCAAGAA CAACGGCGGC GACGGCATCG CCTTCGACCT GAAGGTGAAG TCCAAGAAGC ACGGCGAGGT GGACCTGGAG GAACTGGTCC TCAAGGCCGG GCAGGGACTG GACGAGAAGG AGCAGAAGGC CAACGTCACC AAGGTGGCGC AGGTGTTCAA CGAACTGCTG CCGATCATCC CGATCTGCGA GCGGTACTCC AACAGCCCGA TCCTGGAGGG CGAGGGCAAC CGGGTCAAGG ACTTCCCCGA CGAGGACGAC CCGATCTACA AGAACTCGCC CTACGCCGAC AACCCGATCA CCCTGGGAAT CGTGACCGGC AAGATCACTC CCAACGACTA A
|
Protein sequence | MTDSSFTVAS TSVSRRRLFQ AAGLGAAGAA GLGTMTACKA DPGIQGKGEF HGGYPYETPP DGHFNTAGAP YAVVPHVFVE GMYLDLTCMP GGYYWWDKQE WEYFLAESFE LDDKENTFTL KVRDGLKWSD GEPLTAKDFE TTYWLCWIRS NPMWKSLDSL KATDDMTIEG KLSNPSSVIE RYMLKTNVAP SHNKDSKMGK TYRDFAEAAM KLHEDGKDQT SKEGEKLGAD LAKFRPENLL TSGPFKLEKK DFTKTQMVLT KNKNGFNADK VKFDKVVVYE GELPQITPLV KDKSVDYASH GFAPNQEKKF KRDGHKIVRP PVYSGPSLYI NFKEVPEFKD VLTRRAIAHA INRADAGSIA LGDSGPAVKY MAGFSDIMVP DWISKEDQDA FDTYEHDLDK AAELMEKAGW KKDGDVWARG DKKMDYEIKW PSTYADWSAC GDAIVDQLTD FGIKLTAQPV DEEQYLEEID KGEFQMAINV WGSSQHPHPH FAFVADLFTH NTPIAKNNGG DGIAFDLKVK SKKHGEVDLE ELVLKAGQGL DEKEQKANVT KVAQVFNELL PIIPICERYS NSPILEGEGN RVKDFPDEDD PIYKNSPYAD NPITLGIVTG KITPND
|
| |