Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sterm_3661 |
Symbol | |
ID | 8599107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sebaldella termitidis ATCC 33386 |
Kingdom | Bacteria |
Replicon accession | NC_013517 |
Strand | - |
Start bp | 3889235 |
End bp | 3890575 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | |
Product | glycoside hydrolase family 4 |
Protein accession | YP_003310426 |
Protein GI | 269122249 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAA AAAATGCCTT TTCAATTTCT ATTGCAGGCG GCGGCAGTAC CTATACACCT GAAATAATTC TAATGCTTTT AGAAAACCTG GATCGTTTTC CTATTAGAAA AATTATTTTT TATGATAATG ATTATGAACG TCAAAAACTG GTAGCAGATG CATGTGAAAT AATACTCCGT GAAAGAGCTC CCGAAATTGA ATTTTTATCA ACAACAAACC CAAAAGACGG GTTTACAGAT ATAGATTTTG TAATGGCACA TATTAGAGTA GGAAAGTATG AAATGAGGGA AAAGGATGAA AAAATTCCTT TGGAGCACGG AATAGTCGGG CAGGAAACCT GCGGACCCGG CGGTATAGCC TATGGACTCA GATCAATTAC CGGAGTGGTG GAGCTTATTG ACTATATGGA AGAATACTCA CCGAATGCAT GGATGCTGAA TTACTCCAAT CCTGCTTCAA TAGTGGCAGA AGGTGTCAGA GTTCTGAGAC CTGATTCAAA AGTTTTGAAT ATCTGTGATA TGCCTATAGA TATAGAAGAA CGCATGGCTG TGATTGCAGG GCTGGAATCA AGAAAAGATA TGACAGTGCG TTATTACGGA CTAAATCACT TCGGATGGTG GGATGATATA AGAGATAAAG ACGGAAATGA CCTTATGCCT GTGATAAAAA AGTATGTTCT GGAAAATGGA TATAATTTAG AGCTGTTAAA TAAAGAGGTA AGGCTTAATG ATCCTGACTG GCTTGATACA TACCGTTTTG CAAAGGAAGT TTATCGTCTG GATACTGAAA CTCTTCCGAC TACTTATTTT AAATATTATC TGTTTTCTGA TTATGCTGTA AAGCATACTG ATAAAAATTA TACACGGGCA AATCAGGTAA TGGACGGACG GGAAAAAAGA GTTTTTGATG AATGCAGTAC CATCGTGGAA AAGAATACCG CAAAGGAAAC AAATTTGGAA ATAGGTGTTC ATGCCAGCTA TATTGTGGAT TTAGCAAGAG CACTTGCATT TAATACACAT GAAAGAATGC TTCTTATTGT AGAAAATAAA GGAGCTATAT GCGGCGTGGA TCCTACAGCT ATGGTTGAAA TCCCGTGCAT TGTCGGTGCC AACGGTCCGG AGCCTTTATC TATAGGTGAA ATCCCGACTT TCCAAAGAGG ACTTATTCAG CAGCAGCTGG CAGTGGAAAA GCTGGCAGTG GAAGCCTATG TAGAAGAATC GTATCAAAAA TTATGGCAGG CCTTGACGTT ATCCCGTACA ATGGGTGATG CTGATCTGGC TAAGGTTATA TTAGATGAAC TGATTGAAGC TAATATAGAT TACTGGCCGA ATTTAAAATA A
|
Protein sequence | MNKKNAFSIS IAGGGSTYTP EIILMLLENL DRFPIRKIIF YDNDYERQKL VADACEIILR ERAPEIEFLS TTNPKDGFTD IDFVMAHIRV GKYEMREKDE KIPLEHGIVG QETCGPGGIA YGLRSITGVV ELIDYMEEYS PNAWMLNYSN PASIVAEGVR VLRPDSKVLN ICDMPIDIEE RMAVIAGLES RKDMTVRYYG LNHFGWWDDI RDKDGNDLMP VIKKYVLENG YNLELLNKEV RLNDPDWLDT YRFAKEVYRL DTETLPTTYF KYYLFSDYAV KHTDKNYTRA NQVMDGREKR VFDECSTIVE KNTAKETNLE IGVHASYIVD LARALAFNTH ERMLLIVENK GAICGVDPTA MVEIPCIVGA NGPEPLSIGE IPTFQRGLIQ QQLAVEKLAV EAYVEESYQK LWQALTLSRT MGDADLAKVI LDELIEANID YWPNLK
|
| |