Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sterm_3467 |
Symbol | |
ID | 8598918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sebaldella termitidis ATCC 33386 |
Kingdom | Bacteria |
Replicon accession | NC_013517 |
Strand | - |
Start bp | 3672087 |
End bp | 3673514 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | |
Product | glycoside hydrolase family 1 |
Protein accession | YP_003310237 |
Protein GI | 269122060 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0716612 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTATTGA AGAAGTCTTT TTTATGGGGT GGAAGTATAG CGGCCCATCA ATGCGAGGGA GCGTGGGATA AAGACGGCAA AGGTGTGGGA ATTATGGATC TGGTAACACA GGGTACTTAT GAAAAACCAA GAGAAATAAC TAAAACAATC GAGGAAGGTA AAATATATCC TTCTCATGAA GGAATTGATT TTTATCATAA TTATAAAGAA GATATTGCTT TATTTGCCGA AATGGGTTTT AAAGCACTGC GTATTTCTGT TGACTGGTCC CGTATATATC CAAAGGGTGA TGAAGAAAAA GCAAATGCAC TGGGAATTCA GTTTTATCAA AATATTGTGG ATGAATTATT GAAGCATAAA ATAGAGCCTG TAGTTACATT ATATCACTTT GAAATGCCTG TTCATTTAGT CACTGAATAT GGTTCATGGA CAAATCGAAA AGTAATAGAT TTTTATTTAA AATTCTGTAA AACAATGTAT GAAGCGTTAA AAGGAAAAGT TCACTACTGG TCTACATTTA ATGAAATGAA TCACATTGAT CCCCAGACAG AAGCATCAGA TATTTTCACA TATATTATTG CGGGTTTAAA GTATTCTGAA ATGGAGGATA AAAAGCAAAC ACTTGCGACA ATAGGATATA ACATGACTTT GGCAGGTGTA AAAGCTGTAA AACTGGGTCA TGATATAGAT GCCGGCAATA AAATAGGATG TGTATTTGGT CTGACACCGG TATATCCGTT TAATTGTAAT CCTGTAAATG TTATGAATGC TTTCAAAGAA AACGAGCATG AATTTTATCA GATTGATGCA ATGTGCAACG GAAAATTTCC CGGTTATAAA CTATGTGAAT ATCAGGAGCA GGGAATAAGT CTTGATATTA CAGATGAGGA TGAACAGGCC TTTGCAGAAG GAAAAATAGA TTTCATTGGA TTGAATTATT ATTCTTCAAG TGTATCTCGT TATGAAGGCA GTGAAAATGA TGAAGAGACT TTATTCGGCG GTATACAAAA TCCTTATCTG GAAAAAAGCA GATGGGGCTG GTCCATTGAT CCTGTCGGTT TGAGATATAT ACTAAATTAT GTTTATCGCA GATACGGTCT TCCTGTCATT ATTACCGAAA ATGGTCTGGG AGCTGTAGAT GAACCGGATA AAAACGGAAA TATCGAGGAT AATTACCGTA TTGAATATCT GCAAAAGCAT ATTGAGCAAA TAAAAAAAGC TGTTATTGAA GATCATGTCG AATGCTTCGG CTATCTGACA TGGGCACCGA TTGATCTGGT AAGTGCAACA ACCGGTGAAA TGAAGAAGCG GTATGGTTTT ATTTATGTAG ATAAACATGA TAACGGCAAA GGCACTTTGG AACGGAAAAA GAAAAAATCT TTTTACTGGT ATAAAAATGT CATAAAATCA AATGGCAGTG AATTATAA
|
Protein sequence | MVLKKSFLWG GSIAAHQCEG AWDKDGKGVG IMDLVTQGTY EKPREITKTI EEGKIYPSHE GIDFYHNYKE DIALFAEMGF KALRISVDWS RIYPKGDEEK ANALGIQFYQ NIVDELLKHK IEPVVTLYHF EMPVHLVTEY GSWTNRKVID FYLKFCKTMY EALKGKVHYW STFNEMNHID PQTEASDIFT YIIAGLKYSE MEDKKQTLAT IGYNMTLAGV KAVKLGHDID AGNKIGCVFG LTPVYPFNCN PVNVMNAFKE NEHEFYQIDA MCNGKFPGYK LCEYQEQGIS LDITDEDEQA FAEGKIDFIG LNYYSSSVSR YEGSENDEET LFGGIQNPYL EKSRWGWSID PVGLRYILNY VYRRYGLPVI ITENGLGAVD EPDKNGNIED NYRIEYLQKH IEQIKKAVIE DHVECFGYLT WAPIDLVSAT TGEMKKRYGF IYVDKHDNGK GTLERKKKKS FYWYKNVIKS NGSEL
|
| |