Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0234 |
Symbol | |
ID | 7407225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 283862 |
End bp | 284848 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643714634 |
Product | glycoside hydrolase family 5 |
Protein accession | YP_002572157 |
Protein GI | 222528275 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAT TACCAAAATA CAAAGGATTC AATCTACTTG GTCTTTTTGT TCCAAATATG AGCTATGGAT TTTTTGAAGA TGATTTTAAA TGGATGCAGG AGTGGGGATT TAACTTTGCA AGAATACCTA TGAACTACAA GAACTGGTAT GTTGAAGGGC GACCGGAAAT AAAAGAGGAA GTTTTAGAAA TGGTAGACAA GATTGTTGTC TGGGGGCAAA AGTATGGTAT TCATATCTGT TTGAACATTC ACGGTGCGCC AGGCTATTGC GTGAATGAAA GAACAAAAGA AGGCTATAAC CTGTGGAAAG ACAGAGAACC TCTTGAGTTA TTTGTATTAT ATTGGCAGAC ATTTGCTAAA AGGTATAAAG GGATATCTTC AAAACATTTG AGTTTTAACC TTATAAATGA ACCAAGACAG TATTCAAAAG AGGAGATGAC AAAAGAGGAT TTTATAAGAG TTATGACATA TACAATTGAA AAGATAAAAG AGATTGATAA AGAGAGGCTT ATTATAATTG ACGGTGTTGA TTATGGCAAT GAACCTGTTT TTGAGCTCAC AAATTTAGAT GTAGCACAAG CTTGCAGGGC ATACATTCCG TTTGAGCTTA CCCATTACAA GGCAGAGTGG GTTGAGGAGA GTAACAAATT TGCTGAGCCA TCGTGGCCGC TTGTACGCGA AAATGGTGAG GTTGTTGACA AGGATTATTT AAGAAGGCAT TATGAAAAGT GGGCAAAATT GTTTGAATAC GATGTTGGTG TAATTTGTGG TGAAGGTGGA GCATACAATA AAACACCCCA TCATATTGTT GTGAGATGGC TGGCTGATGT ATTGGATGTT TTAAAAGAGC TTGGGATTGG GATTGCGCTT TGGAATTTGC GTGGTCCTTT TGGAATTATT GATTCTGGAA GAGATGATGT CGAATATGAA GACTTTTATG GGCACAAGCT TGATAAAAAG TTACTCGAAC TTTTGATGAG ATTTTGA
|
Protein sequence | MNKLPKYKGF NLLGLFVPNM SYGFFEDDFK WMQEWGFNFA RIPMNYKNWY VEGRPEIKEE VLEMVDKIVV WGQKYGIHIC LNIHGAPGYC VNERTKEGYN LWKDREPLEL FVLYWQTFAK RYKGISSKHL SFNLINEPRQ YSKEEMTKED FIRVMTYTIE KIKEIDKERL IIIDGVDYGN EPVFELTNLD VAQACRAYIP FELTHYKAEW VEESNKFAEP SWPLVRENGE VVDKDYLRRH YEKWAKLFEY DVGVICGEGG AYNKTPHHIV VRWLADVLDV LKELGIGIAL WNLRGPFGII DSGRDDVEYE DFYGHKLDKK LLELLMRF
|
| |