Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2474 |
Symbol | |
ID | 7409343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2612945 |
End bp | 2613874 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643716837 |
Product | hypothetical protein |
Protein accession | YP_002574315 |
Protein GI | 222530433 |
COG category | [S] Function unknown |
COG ID | [COG5464] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01784] conserved hypothetical protein (putative transposase or invertase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000000139417 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGATA GTTTGCCACC ACAGGAGCAT GATTCAACTT TTAAGTTTTT GTTTGAAAAT GCAAAAGATA TCCTTTTCCT TGTCAGGGAC GTAATAGGCT ACAGCTGGGC AAAAGATATT CAAGAAGACT CAATAGAACT TGCGAACAAA GAATTTGTAG ATGAAGACTT TCTGCAAAGA AGAGCAGACG TCATAGCAAA AGCAAAACTA AAAGACAGGG AAGTATACTT CTACATTATC ATCGAAAATC AATCGAGAGT CGATGGGAAT ATGCCAAAAA GACTTTTGGA GTACATGATT TTGCTATGGG CAAAGAAAAT CAGAGAAGGT GTAAAGAAAC TTCCGGCGAT AATTCCAATA GTAACATACA ACGGTCTTGA TAAGGACTGG GATATACCAC AGGAAATAAT CAGCGAATTT GATATTTTCA AAGACGATAT TTTCAGGTAC GCTCTTGTAA ACATTTCAAA ATTAGATGCA AAGGCTCTGT TGCAAGAGGA AGAGGATGTC TTGAGCCCGG TAGTGTTCTA CTTAGAACAA GTGCGAGATG ATACAGAAAA GTTAATTGAG AGGCTAAAAG AGCTTGTACC AAAACTGCAA AACTTCAGTC AAACCAATAT GGAGAGGTTT TTAACATGGG CGGGAAATGT AATACGTCCG AGGTTTCCAA AAGAGGAAAG GGAGAAGTAT GATAAGCTTG ACCAGGAGCT AAAGCAGGGG GGAGTGGCGA AAATGGGTGA GTTTGTATCT AATGTTGCAA AACTACTGGA TGAAGCACAG ATGAAAAAGT ACAACGAAGG CGTTATTAAA ACAAGAATAG AAATAGCAAG GAACATGATA AAAGAAGGGG CAGAGGACAT CTTTATAGCA AAGGTGACAG GACTTACAAT TGAAGAAGTG AGAAAACTCA GAGACGAAAC TCTATCATAA
|
Protein sequence | MRDSLPPQEH DSTFKFLFEN AKDILFLVRD VIGYSWAKDI QEDSIELANK EFVDEDFLQR RADVIAKAKL KDREVYFYII IENQSRVDGN MPKRLLEYMI LLWAKKIREG VKKLPAIIPI VTYNGLDKDW DIPQEIISEF DIFKDDIFRY ALVNISKLDA KALLQEEEDV LSPVVFYLEQ VRDDTEKLIE RLKELVPKLQ NFSQTNMERF LTWAGNVIRP RFPKEEREKY DKLDQELKQG GVAKMGEFVS NVAKLLDEAQ MKKYNEGVIK TRIEIARNMI KEGAEDIFIA KVTGLTIEEV RKLRDETLS
|
| |