Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sterm_2547 |
Symbol | |
ID | 8598008 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sebaldella termitidis ATCC 33386 |
Kingdom | Bacteria |
Replicon accession | NC_013517 |
Strand | - |
Start bp | 2708462 |
End bp | 2709901 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | glycoside hydrolase family 1 |
Protein accession | YP_003309328 |
Protein GI | 269121151 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0806308 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAGAAA AGCAGTCTGT CTTACCTGAA AATTTTTTAT GGGGAGGTGC TGTTGCGGCT AATCAGTGTG AAGGTGCATA TCTTGAAGAC GGAAAGGGAT TGTCTCCTGT GGATATTCTT CCTGATGCAG CACACGGCAG ATGGGAAGCA CTGGAACATC CGATGAAAAC ATTGGAAACA AAATATGATT TTTATCCCAG TCATGAATCT ATTGATTTTT ATCACAGATA CAAAGAGGAT ATAAAATTAT TTGCAGAAAT GGGATTTAAA TGCTTTAGAA CGTCAATCTG CTGGCCGAGA ATATTTCCAA ACGGAGATGA TATGATACCA AATGAAAAAG GACTGGAATT CTATGATAAT TTATTTGATG AATGTAAAAA ATACGGAATA GAGCCTTTAG TAACAATAAA TCATTTTGAT ACACCTGTAG AACTGATGAA AAAATATAAC GGCTGGAAGG ACAGAAGATG TATTGATTTT TATCTGAGAT TCTGTGACGT TATATTCAAA AGATATAAGG GAAAAGTGAA GTACTGGATG ACATTTAACG AAATAAACAT GATACTTCAT ATACCATTTT TTGGCGGAGG AATAGATGTA ACAAATGAGG AAAATCCTCT TCAGACAAAG TATCAGGCGG CACATCATCA GCTGGTAGCA TCTGCTATGG CAACTAAGCT GGGACATGAA ATAGATCCTG AAAATAAAAT AGGATGTATG CTTGCAGCAG GACAGACATA TCCATATACA TGCAATCCTG AGGATATCTG GAAATCAATA GAAAAAAACA GAGAGGGTTA TTTCTTTATA GACATACAGT CAAGAGGATA TTATCCCAGT TACAGTAAAA GATTTTTTGA AGAAAATAAT ATAAAGCTGG AATTGCATGA AGGGGATGAA AAGGTATTAA GGGAAAATAC AGTGGATTAT ATATCATTTA GTTATTATTC TTCGCGTCTG ACAAGTGCTG ATCCGCAAAA TAACAGCAAA ACTGAGGGTA ATGTATTTTC AACAATGAAA AATCCATATC TTCAGGCTAG TGAGTGGGGA TGGCAGATTG ACCCGCTGGG ATTAAGAATT ACAATGAATG AGATATATGA CAGATATCAA AAACCTTTAT TCATAGTGGA AAATGGTCTG GGAGCCGTGG ATACCGTGGA ACCAGACGGA AGCATAAATG ATGATTACCG TATAGAATAT TTAAGAGAGC ATGTACGTGA AATGAAGGAG GCAGTCCTTG ACGGAGTGGA ACTGATGGGA TATACTCCAT GGGGCTGTAT AGATCTTATA AGTGCCGGAA CCGGTGAAAT GAAAAAACGT TATGGTTTTA TATATGTTGA CAGAGATAAT GACGGTAACG GAACACTGGA ACGAAGCAGA AAAAAATCAT TTGAATGGTA TAAACAGGTT ATAGCAAGTA ATGGGGAAAA TATAAAATAA
|
Protein sequence | MSEKQSVLPE NFLWGGAVAA NQCEGAYLED GKGLSPVDIL PDAAHGRWEA LEHPMKTLET KYDFYPSHES IDFYHRYKED IKLFAEMGFK CFRTSICWPR IFPNGDDMIP NEKGLEFYDN LFDECKKYGI EPLVTINHFD TPVELMKKYN GWKDRRCIDF YLRFCDVIFK RYKGKVKYWM TFNEINMILH IPFFGGGIDV TNEENPLQTK YQAAHHQLVA SAMATKLGHE IDPENKIGCM LAAGQTYPYT CNPEDIWKSI EKNREGYFFI DIQSRGYYPS YSKRFFEENN IKLELHEGDE KVLRENTVDY ISFSYYSSRL TSADPQNNSK TEGNVFSTMK NPYLQASEWG WQIDPLGLRI TMNEIYDRYQ KPLFIVENGL GAVDTVEPDG SINDDYRIEY LREHVREMKE AVLDGVELMG YTPWGCIDLI SAGTGEMKKR YGFIYVDRDN DGNGTLERSR KKSFEWYKQV IASNGENIK
|
| |