Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0359 |
Symbol | |
ID | 7409289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 411945 |
End bp | 413345 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643714745 |
Product | hypothetical protein |
Protein accession | YP_002572268 |
Protein GI | 222528386 |
COG category | [S] Function unknown |
COG ID | [COG3885] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00296] uncharacterized protein, PH0010 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000914729 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAGGAT ATTTGCTTCC ACATCCACCA GTGCTGATTC CCGAGATTGG GAGAGGCGAG GAGAAAAAGT GCCAGGCGAC CTTAGATGCT TTACAAAAGG TAGCAGATGA GATAGCTGAA TATAAACCTG AGGTCATAGC CATAATATCA CCCCATGCAC CTGTGTTTAC GGATGCTTTT TTCTTGAACG ACAAGCCAGA AATTGGTGGA AGCCTTGCAA GATGGGGTGT ATATGGAATT GAATTTAGGT TCAAAAATAA CCTTGAGATA GTTCAAGACA TAGCAAAAAT GTGCAGCCAG GAAGGGTTGA CGGTTGGATT TGTGTCAGAC AAAATTCAAA AAAGATATGG CGTTTCGCGA GAGCTTGACC ATGGCGCGTT AGTTCCGCTT TATTTTATTA CCAGAAAGTA TAAAGAATTT GAGCTTATAC ACACTTCTTA CTGTATGCTT GATGATATTA AGCTTTATAA ATATGGAATG ATACTCAGAA GGGCAATTGA AAAGCATGGC AAAAAAGGTT TAATTATAGC TTCAGGCGAC CTTTCGCACA AACTCTCTTA CGATGGGCCT TACGGGTTTG CAAAAGAAGG ACCTGAGTTT GACAAACTTC TGGTTGAACT TTTGCAAAGT AGCAATGTAC GAGCACTTTA TGACATAGAT CCTGTACTTT CAGAGAAGAC GGCAGAATGT GGTTTCAGAT CCATAAAGGT TTTGCTTGGA GCATTTGAGG GCTATAGTAT AGAATCAAAG GTTTATTCAT ATGAAGGACC TTTTGGCGTT GGATACTGTG TTGCTGCCTT TTACCAGAAA GAACAGACAA GCTCTTCTTT GTTTGAGGAG ATAGTGAAAA AAAGAGAAGA GAGACTAAAG AGAATAAGAG AAAATGAAGA TGAATATATA AGACTTGCAA GAGAAAGCTT AGAATACTAT GTAAGACACC GCAGGTACTT AGATTATATA CCAGATTATG TCACAGAACG GATGCTAAGA GAAAGAGCAG GAGTTTTTGT GTCAATTAAA AAGGATGGAA ACTTGAGAGG ATGTATAGGT ACAATTTATC CTACTCAAGA AAACATTGCA AAAGAGATAA TCAGAAACGC TGTTGCAGCA GGGTTTCACG ACCCCAGGTT TGAAGAGGTA ACAGAAGATG AGCTTGACAG TCTTGTGTAT GATGTTGATA TTCTAAGCCC ACCTGAGAAG GTAAACTCGA AAGACCAACT TGATCCTAAA AAATATGGAG TTATTGTGCG AAAAGGTGCA AGACAAGGGC TTTTGCTTCC TGATTTAGAA GGTGTTGACA CAGTTGAAGA GCAGCTTAAG ATAGCCTGCA GAAAAGCAGG AATTGATTAT GAAAGTGAAG ATTTTGAGAT AGAAAGGTTT ACAGTTGAAA GACACAAGTA G
|
Protein sequence | MVGYLLPHPP VLIPEIGRGE EKKCQATLDA LQKVADEIAE YKPEVIAIIS PHAPVFTDAF FLNDKPEIGG SLARWGVYGI EFRFKNNLEI VQDIAKMCSQ EGLTVGFVSD KIQKRYGVSR ELDHGALVPL YFITRKYKEF ELIHTSYCML DDIKLYKYGM ILRRAIEKHG KKGLIIASGD LSHKLSYDGP YGFAKEGPEF DKLLVELLQS SNVRALYDID PVLSEKTAEC GFRSIKVLLG AFEGYSIESK VYSYEGPFGV GYCVAAFYQK EQTSSSLFEE IVKKREERLK RIRENEDEYI RLARESLEYY VRHRRYLDYI PDYVTERMLR ERAGVFVSIK KDGNLRGCIG TIYPTQENIA KEIIRNAVAA GFHDPRFEEV TEDELDSLVY DVDILSPPEK VNSKDQLDPK KYGVIVRKGA RQGLLLPDLE GVDTVEEQLK IACRKAGIDY ESEDFEIERF TVERHK
|
| |