Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0679 |
Symbol | |
ID | 7407103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 764058 |
End bp | 765008 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643715055 |
Product | glycosidase PH1107-related |
Protein accession | YP_002572571 |
Protein GI | 222528689 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2152] Predicted glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000013124 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGG TCTTTGCAAA AAAGGACATA TTTACACGCT ACAGCGGAAA TCCAATAATC ACAGTATTTG ATATTCCATA TTCAGCAAAT GCTGTGTTTA ACGCTGGTGC AATAAAATAT AAAAATGAGT ACTTACTACT TTTGAGAGTT GAGGATAGAC AAGGAAAATC GCATTTGACT GTTGCTCGAA GTAGTGATGG AAAGACTAAT TGGAAGATTG AAAAGTCACC TCTGATTTAT CCCCAACCCA CAGTGTTCAT ATACGAAGAG TTTGGCTGCG AAGACCCGCG AATAACCTAT ATTCCTGAAG ATGATTACTA CTATATAACC TATACTGCAT ACTCTCGTTA TGGTCCAGCA GTTGCACTTG CAAGAACAAA GAACTTTAAA AAAGTTGAAA AAATGGGATT AATTTATCCT CCCAACAACA AAGATGCAGT TTTGTTCCCT GAAAAAATTA ACGGTAGGTA CGCCATGCTT CACAGACCTG TTGCAGGGGA CATCGAACAC ATTTGGATTG CATATTCGTC AGATCTTATC CACTGGGGAA ACCATGAGGT TGTGCTTGTT GAAAAAGGAG GCCCGTGGTG GGACGGCTTT AAAGTCGGCG CAGGAGCTGT GCCAATAAAA ACTCAAGAGG GCTGGCTAAT AATCTATCAC GGCGTGAAGA TGATGCCATC AGGACCAATT TACAGGCTTG GTGCTGCGCT TTTGGACTTA GAAAATCCGG CAAAGGTCAA GAAAAGATGT CCAGAGTGGC TATTATCACC TCAGGAAGTA TATGAACGAA TTGGAGATGT CAACAATGTG GTATTCACCT GTGGAGCAAT TGTTGAAGAC AACCAAATCT ATCTTTACTA TGGAGCTGCA GACTCTTGCA TTGCTCTTGC TTTTGCTGAG ATTGACCAGA TTTTGTCCAT CTTGATTGAG GGGGATTTGG CAACAAAATA G
|
Protein sequence | MKKVFAKKDI FTRYSGNPII TVFDIPYSAN AVFNAGAIKY KNEYLLLLRV EDRQGKSHLT VARSSDGKTN WKIEKSPLIY PQPTVFIYEE FGCEDPRITY IPEDDYYYIT YTAYSRYGPA VALARTKNFK KVEKMGLIYP PNNKDAVLFP EKINGRYAML HRPVAGDIEH IWIAYSSDLI HWGNHEVVLV EKGGPWWDGF KVGAGAVPIK TQEGWLIIYH GVKMMPSGPI YRLGAALLDL ENPAKVKKRC PEWLLSPQEV YERIGDVNNV VFTCGAIVED NQIYLYYGAA DSCIALAFAE IDQILSILIE GDLATK
|
| |