Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sterm_4066 |
Symbol | |
ID | 8599510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sebaldella termitidis ATCC 33386 |
Kingdom | Bacteria |
Replicon accession | NC_013517 |
Strand | + |
Start bp | 4329232 |
End bp | 4331004 |
Gene Length | 1773 bp |
Protein Length | 590 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | glycoside hydrolase family 2 sugar binding protein |
Protein accession | YP_003310829 |
Protein GI | 269122652 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATAGAA ATGAATATCC CAGACCGGAT TTTGTGAGGG AACATTGGGC TTGTCTAAAC GGTATATGGG ATTTTGAATT TGATGATAAT AATATCGGAA TGTCATCAAA ATGGTATAAA AAAGATCATA AGCTTACTGA AAAAATCAAT GTTCCTTTTG TTTTTCAATC AAAGCTAAGT AATATCAATA TAAATGATTT TCACGATTTT ATCTGGTATA AAAGAGAATT TACAATAGAT GATTCCTGGA AAAATAAAGA TATTTTACTG CATTTCGGAG CCGTGGATTA CAGATGTCAT GTTTTTATCA ACGGAGAGCT GGCAGGAAGC CACGAAGGCG GACACACTTC CTTTTATCTG AATATCACCA ATTATCTTAC ATGGAATAAA GAGGAAATAA CTGTATATGT AGAAGATCCT TCTGATGATG AGACTATCCC AAGGGGAAAG CAATACTGGC TTGAAAATCC CGAGAGTATT TGGTATAAGC GTTCCAGCGG CATCTGGCAG TCTGTCTGGA TTGAACCTGT TAATAAAAAC TATATCACAG ACTTTAGATG TACTCCTTTA TTTGATCAGG GTTCTGTAGA ATTTAACATA AAAACAAAAT CTGCCAAAAA AAATACAAAA ATCATGATAC AAATCTCATT TAGGGATACC CTGATAGCTG AAGATATAAT AACTGTCAAT AATATTGAAA CTAAACGTAT CTACGATATT TTTCAAAAGA AAATTTTCAG AGGCTGCACG CATGGTTCCG GATGGACATG GACTCCTGAA AATCCTAATT TATTTGATGT TACACTTACT CTTCTTACCA ATGACAAAGT CTCAGATAAA ATAGAAAGTT ATTTTGGAAT GAGAAAAATA CATACTGAAA ACGGAAAAGT ATACCTGAAT AACAGACCTT ATTATCAAAG ACTGGTACTG GATCAGGGCT ACTGGCCCGA CAGCCTTATG ACAGCCCCTT CTGATGAGGA TTTTAAAAAA GATATTATAC TGGCAAAACA GATGGGCTTT AACGGATGCA GAAAACATCA AAAGATAGAA GACCGGCGTT TTCTTTACTG GGCTGATAAA CTCGGCTATC TTGTATGGAG CGAGATGCCC AGCACTATCT CATATGATTC GAATTCTGTT TCCAGAATTA CAAATGAATG GATAGAATCC GTAGACAGAG ATTATAATCA TCCCTGTATT GTTACATGGG TTGCATTAAA TGAAAGCTGG GGTGTTTCGG AAATTAATTA TAATAAAATG CAGCAAAGCC ACTCACTTTC TTTATATTAT ATGCTTCACT CGCTTGACAA TACCAGACCG GTTATTGCAA ATGACGGATG GGAAGCTACA AAAACTGATA TTTGCGCTGT ACATAATTAC CAACATGGTA CAAAAGATGA GAAAGAAAAA TATGAGAAAT TTATCAAAGA TTTGAGTACA AAAGAAGACA TACTGGACTC TGTACCGGCG GGGCGTAATA TTTATGCTGA CGAATTCGAA TATACCGGCG AGCCTGTTAT GCTCACAGAA TTCGGCGGGA TCGGCTATGA TAAAATCCGT CCTGACGGCT GGGGATACAC AGTTGCTTCC AGTGAAGCAG AATTTATTCA TGATTTAGAG CGTGTCTTCG ATGCCCTGCG AAAATCAAAA GTATTAACCG GATTCTGCTA TACACAGTTT ACTGATGTAG AACAGGAAAT AAACGGTCTT CTTACTTATG AGCGTGAACC AAAATGTGAT TTGGAAATTA TAAAAAATAT TGTAGAAAAG TAA
|
Protein sequence | MHRNEYPRPD FVREHWACLN GIWDFEFDDN NIGMSSKWYK KDHKLTEKIN VPFVFQSKLS NININDFHDF IWYKREFTID DSWKNKDILL HFGAVDYRCH VFINGELAGS HEGGHTSFYL NITNYLTWNK EEITVYVEDP SDDETIPRGK QYWLENPESI WYKRSSGIWQ SVWIEPVNKN YITDFRCTPL FDQGSVEFNI KTKSAKKNTK IMIQISFRDT LIAEDIITVN NIETKRIYDI FQKKIFRGCT HGSGWTWTPE NPNLFDVTLT LLTNDKVSDK IESYFGMRKI HTENGKVYLN NRPYYQRLVL DQGYWPDSLM TAPSDEDFKK DIILAKQMGF NGCRKHQKIE DRRFLYWADK LGYLVWSEMP STISYDSNSV SRITNEWIES VDRDYNHPCI VTWVALNESW GVSEINYNKM QQSHSLSLYY MLHSLDNTRP VIANDGWEAT KTDICAVHNY QHGTKDEKEK YEKFIKDLST KEDILDSVPA GRNIYADEFE YTGEPVMLTE FGGIGYDKIR PDGWGYTVAS SEAEFIHDLE RVFDALRKSK VLTGFCYTQF TDVEQEINGL LTYEREPKCD LEIIKNIVEK
|
| |