Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2123 |
Symbol | |
ID | 7408832 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2256842 |
End bp | 2257642 |
Gene Length | 801 bp |
Protein Length | 266 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643716488 |
Product | Cof-like hydrolase |
Protein accession | YP_002573971 |
Protein GI | 222530089 |
COG category | [R] General function prediction only |
COG ID | [COG0561] Predicted hydrolases of the HAD superfamily |
TIGRFAM ID | [TIGR00099] Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily [TIGR01484] HAD-superfamily hydrolase, subfamily IIB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000000204309 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAAC TTGTAGCAAC AGACCTTGAT GATACGTTGC TGTCAAAAGA CTTAACAATA ACTGAGAAAA ATTTAAACGC TATAGAGTTT TTAAAGAAAA ACAATATTAT TTTTATATTA GCTTCAGGAA GACCATACCC TTCTATTAAA AATGTGGCAT ATGACCTTCA AAATTTTTAC CCAATGATAA CATACCAGGG TGCACTTGTT TATGACCCAA AAAATGACAA AAAGCTTTAT GGCTGCGAAA TTAAGCCAGA GGACGCAAAA GAACTTGTAA GGCTTGCAAA AGATGAAGGA ATTCATGTTC ATATTTACAT TGACAATGTA TGGTATGTTG AAGCTATGAA CGAAAAGACT GAATACTATA GAAATCTTAC AAAGCTTGAA CCCCACATAG TTAAAAATTT ACTTGAATTT ATCGACAGAC CTGTCACAAA GGTTTTGTTT TTTGATGAAC ATGAAAGATT AAAAGATTTG AAAGAAAGTC TTCCAGATGA TTTTTCGAAG AAATTCAACA TAATGTTTTC AAAACCCTTC TTCTTGGAAT TTACAGATAT CAATGTTTCA AAGGGGAATG CTCTTAAGTT TTTAACCGAG TACTACGGTT TGAAAAGAGA AGAGGTTATG GCAATTGGTG ATGGTGACAA TGATATTTCA ATGATTGAAT ATGCAGGGAT AGGTGTTGCT GTTGAAAATG CAGTTGAAAA GCTAAAAGAA GCTGCTGACT TTGTTGTTGC AAAGAGTGAT GATAGCGGTT TTGCACAGGC TATAGAAAAA GTATTCAACG TTCATTTTTA A
|
Protein sequence | MIKLVATDLD DTLLSKDLTI TEKNLNAIEF LKKNNIIFIL ASGRPYPSIK NVAYDLQNFY PMITYQGALV YDPKNDKKLY GCEIKPEDAK ELVRLAKDEG IHVHIYIDNV WYVEAMNEKT EYYRNLTKLE PHIVKNLLEF IDRPVTKVLF FDEHERLKDL KESLPDDFSK KFNIMFSKPF FLEFTDINVS KGNALKFLTE YYGLKREEVM AIGDGDNDIS MIEYAGIGVA VENAVEKLKE AADFVVAKSD DSGFAQAIEK VFNVHF
|
| |