Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0524 |
Symbol | |
ID | 7408648 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 591018 |
End bp | 592949 |
Gene Length | 1932 bp |
Protein Length | 643 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643714906 |
Product | aconitate hydratase |
Protein accession | YP_002572423 |
Protein GI | 222528541 |
COG category | [C] Energy production and conversion |
COG ID | [COG1048] Aconitase A |
TIGRFAM ID | [TIGR01342] aconitate hydratase, putative, Aquifex type [TIGR01343] homoaconitate hydratase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000206753 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTTGA CAGTTGCGCA GAAGATTATA AAACAACACT TAGTTAAAGG TGAAATGATA CCAGGAAAAG AGATTGCAAT CAGAATTGAC CAAACACTTA CACAGGATTC AACTGGTACA ATGGCATATC TTCAGTTTGA AGCAATGGGT ATTGACAGGG TAAAAACTAA AAGGTCTGTT GCATACATTG ACCATAATAC ACTTCAGACA GGGCCAGAGA ATGCAGATGA TCATCTATAC ATACAAACTG TTGCAAAAAA ATATGGTATT TACTTTTCAA AACCCGGTAA TGGAATCTGC CACCAGGTAC ATCTTGAAAG GTTTGCGGTT CCAGGACAAA CACTTTTGGG GTCAGACAGC CACACACCAA CAGCTGGTGG AATAGGCATG CTTGCAATTG GTGCAGGTGG TTTAGATGTT GCAGTTGCAA TGGGTGGTGG CGAATATTAC TTAATTATGC CTAAGATTGT AAAAGTAAAC CTCAAAGGCA AGCTTCAACC TTGGGTTTCT GCAAAGGATA TTATTTTAGA GCTTTTGAGA AGGCTTACAG TAAAGGGTGG CGTTGGCAAA ATTTTTGAAT ACACAGGTGA GGGTGTAAAG ACTTTATCTA TACCAGAGAG AGCCACGATT ACAAATATGG GGGCAGAACT TGGAGCAACA ACTTCTATAT TCCCATCTGA TGAGGTGACA TACGAATTTT TGAAGGCACA GGGAAGAGAA GCTGACTTTG TTGAGATTCT GCCAGACCCA GATGCACAGT ATGATGAGGA GATTGAGATA GATTTATCGA GTCTGGTGCC GCTTGCAGCA TGCCCGCACA GCCCTGACAA TGTTGTGCCT GTGAGTGAGT TAAAAGGTAT AAAGGTTGAC CAGGTTGCAA TTGGAAGCTG CACAAACTCA TCTTACAAGG ACCTCATGAA GGTGGCAAAG ATTTTGGAAG GAAAAACCAT TGCAGAGCAT GTATCGCTTG TCATATCTCC AGGGTCAAAA CAGGTTTTGA ACATGCTTGC TCAAAACGGT GCACTGGCAT CAATGGTTGC AGCAGGTGCA AGGATTTTAG AGTGTGCTTG TGGTCCTTGT ATAGGCATGG GTCAAGCACC AAGAACAGGA GGCATTTCGC TCAGAACATT TAACAGGAAC TTTGAAGGAA GAAGCGGTAC ACCTTCTGCC AAAGTGTACC TTGTTTCTCC TGAGACTGCT GCAGCTTCTG CCATCACAGG TTATATCACA GACCCAAGAA CCCTTGGCGA TGAGCCAGAG GTTGAGATGC CAAAAAGTTT TCTTATAAAT GACAATTTAA TAGTGCCACC TGCTGAAAAT TCTGACGAGA TTGAGGTTAT AAGAGGACCG AATATAAAAC CATTTCCGCA AGGGAAGCCT TTGCCAGATG TTGTTGTGGG AAAAGTCTTG ATAAAGCTTG GAGATAATAT CACAACAGAC CACATTATGC CGTCTAACGC AAAGCTTTTG CCATATAGGT CGAACATTCC GTATTTATCT GATTACTGCT TGACACCATG CGACCCTAAT TTTCCTAAAA AAGCACGTGA AAATGGTGGA GGGTTTATAG TAGGTGGAAT AAACTATGGA CAGGGGTCAT CAAGAGAACA TGCAGCGCTT GTGCCGCTTT ATTTGGGTAT AAAAGGGGTT TTGGCGAAGA GCTTTGCACG AATACACATG GCAAATTTGA TAAACAACGG AATCATACCA ATGGTGTTTG AAAATCAGAA TGATTATGAT ACAATTGAAG AGATGGATGA ACTTAAAATT GAAAATGCAA GAGAACAGAT AGAAAAAAGT GATGTTTTAA TAATTGAAAA TGTCACGAAA GGATTAAAGT ACAGAATGAT TTTAAACCTG ACAGACAGAC AGCGTCAGAT GATTTTGCAT GGCGGGCTTT TGAACCTTAC AAAAGCTATG GGGATGAAAT AA
|
Protein sequence | MGLTVAQKII KQHLVKGEMI PGKEIAIRID QTLTQDSTGT MAYLQFEAMG IDRVKTKRSV AYIDHNTLQT GPENADDHLY IQTVAKKYGI YFSKPGNGIC HQVHLERFAV PGQTLLGSDS HTPTAGGIGM LAIGAGGLDV AVAMGGGEYY LIMPKIVKVN LKGKLQPWVS AKDIILELLR RLTVKGGVGK IFEYTGEGVK TLSIPERATI TNMGAELGAT TSIFPSDEVT YEFLKAQGRE ADFVEILPDP DAQYDEEIEI DLSSLVPLAA CPHSPDNVVP VSELKGIKVD QVAIGSCTNS SYKDLMKVAK ILEGKTIAEH VSLVISPGSK QVLNMLAQNG ALASMVAAGA RILECACGPC IGMGQAPRTG GISLRTFNRN FEGRSGTPSA KVYLVSPETA AASAITGYIT DPRTLGDEPE VEMPKSFLIN DNLIVPPAEN SDEIEVIRGP NIKPFPQGKP LPDVVVGKVL IKLGDNITTD HIMPSNAKLL PYRSNIPYLS DYCLTPCDPN FPKKARENGG GFIVGGINYG QGSSREHAAL VPLYLGIKGV LAKSFARIHM ANLINNGIIP MVFENQNDYD TIEEMDELKI ENAREQIEKS DVLIIENVTK GLKYRMILNL TDRQRQMILH GGLLNLTKAM GMK
|
| |