Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sterm_3871 |
Symbol | |
ID | 8599317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sebaldella termitidis ATCC 33386 |
Kingdom | Bacteria |
Replicon accession | NC_013517 |
Strand | + |
Start bp | 4116270 |
End bp | 4117580 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | glycoside hydrolase family 4 |
Protein accession | YP_003310636 |
Protein GI | 269122459 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGAC TAAAAATTAC TACTATCGGC GGAGGTTCCA GTTATACTCC TGAATTAATA GAAGGATTTA TCAGAAGATA TGACGAACTG CCGGTTACTG ATTACTATCT GCTTGATATC GAGGAAGGGA AAGAAAAGCT TGAGATTGTT GGCGAATTTG CCAGAAGAAT GGTCAAAAAA GCCGGAGTTC CCATAAATAT TCATCTTACT CTGGACAGAG AAGAAGCCCT GAAAGATGCG GATTTTGTGA CAACTCAGTT AAGAGTGGGA CTTCTTGATG CCAGAATCAA TGATGAAAAA ATCCCTTTAA AATATAATGT GCTTGGTCAG GAAACTACAG GACCCGGCGG ATTTATGAAA GCCCAGAGAA CTATTCCTGT TCTTTTAGAT ATATGCGAAG ATATGAAGAG ACTTTGTCCC GATGCATGGC TTATTAATTT TACCAATCCT GCAGGTATAG TAACAGAAGC AATAAAAAAA TACAGCAGTA TAAAAACTAT AGGTATTTGC AGCGGTGCAA ACAGCATGCT TATGGATATT GCAAAAGCTT ATGATGTACA AAAAGACGAC ATCTACACCA GAATAATCGG GCTGAATCAT CTGATTTTTG CAGATAAAAT ATTTCTAAAA GGCGAAGATA TTACTGATGA TTTTATAAAA AAATTATCTG CCGGAAAAGC GGATAACAGT TTGAAAAATA TTCCGGATAT AGGCTTTTCC GCTAAATTCA CAGAAGCACT GCATATGTAT CCTATTTCAT ATCTGAAATA TTTTTTCCTG AACAGAGAAA TGGTTGAAAT TGCAAAAAAA GACGAAGCTG AAAAAGGTAC AAGAGGTGAA CAGACAAAGG CAATAGAACA TAATCTATTT GAGCTTTATA AAGATAAAAA TCTGGACACC AAACCTAAAG AATTGGAAAA ACGCGGAGGA GCATATTACT CTGAAACAGC ATGCTCTATA ATAAGCTCTA TTTATAATAA TAAAAAAGAA ATACATGTGG TAAACACTCT TAATAACGGT ACCACTTCTG ATCTTCCCGA TAATGTGGTA ATTGAAACAA ATGCGGTAAT AGATAAGGAT GGTGCACATC CTGTCACATA TGGAAAACTG CCTGTAAAAA TAAGGGGACT TATCCAGAGT GTGAAAGCAT ACGAAGAGCT TACAGTGGAA GCTGCTGTAA CAGGCAGCTA TGATACGGCT CTCCTTGCCC TGAGCATTAA TCCTCTTGTT CCGTCTGCAA ATGTAGCAGA AAGTATACTA AATGAGCTTC TTGAGGTTAA TAAAAAATAC CTGCCTCAAT ATTTTAAATA A
|
Protein sequence | MKGLKITTIG GGSSYTPELI EGFIRRYDEL PVTDYYLLDI EEGKEKLEIV GEFARRMVKK AGVPINIHLT LDREEALKDA DFVTTQLRVG LLDARINDEK IPLKYNVLGQ ETTGPGGFMK AQRTIPVLLD ICEDMKRLCP DAWLINFTNP AGIVTEAIKK YSSIKTIGIC SGANSMLMDI AKAYDVQKDD IYTRIIGLNH LIFADKIFLK GEDITDDFIK KLSAGKADNS LKNIPDIGFS AKFTEALHMY PISYLKYFFL NREMVEIAKK DEAEKGTRGE QTKAIEHNLF ELYKDKNLDT KPKELEKRGG AYYSETACSI ISSIYNNKKE IHVVNTLNNG TTSDLPDNVV IETNAVIDKD GAHPVTYGKL PVKIRGLIQS VKAYEELTVE AAVTGSYDTA LLALSINPLV PSANVAESIL NELLEVNKKY LPQYFK
|
| |