Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1598 |
Symbol | |
ID | 7409428 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1690566 |
End bp | 1692314 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643715967 |
Product | protein of unknown function DUF262 |
Protein accession | YP_002573465 |
Protein GI | 222529583 |
COG category | [S] Function unknown |
COG ID | [COG3472] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCTCC AGCAAGGACA GATGAACTTA TTGGACTTAG TTAGAAAGGC TTATAATGGT GAATGTATGC TTCCCGATTT TCAAAGAAAT TTTGTTTGGA CAAGATATGA CATTGAAGAG CTTATAAAAT CACTTCTTCA AGGTATGTTT ATAGGAACTT TTTTGATTTT AGAAACAAAT CCACAAAGTG TTCCTTTTAA GGTGATTTTT GTAGAAGGGG CAGAGAAAGT AAATCCCCAA ATATGTGAGC AACCCAAGAT ATTAATTTTA GATGGACAAC AAAGACTTAC ATCTTTATTT TACGCTATAT ATAGTCCTGA TATTCCTCTA CGCAATTCTG AGAATCCTTA TGCATTCTTT ATTGATTTAG AAAAACTTGC TGAAGATAAC ATTGAGGACG CGGTATTCAG CTGGTCAAAA AAATGGAGAG AGTTCAAGGA GATTATTGAT GAAAACGGAG ATTACAATCT TGAGGTTTTA AAGGCGAAGA AAGTATTGCC ATTGACAGTG TTTAAAGATA TCCCTGAGTT CTATAGATTA TGGTTTGGGG AGTATAAATT GTTATTTAAA GACCAGGAAG CAAATAAAAT ATTTGCTTAT ATTGATAACA TGATAAAATA TAACATTTTT ACTCTATCGC TTGGTCTTTC GTATAATGAC AAACCCGATG AAATTGCTGC TCTATTTGAA AAAATCAATA GAAGTGGTGT AAAACTTTCT ATTTATGATC TTCTTGTAGC AAGATTTTAC AAGTTCATAA GACTTCGTGA AAAGTGGGAA GAAGTATTTG AAAATAGTGT TAATATTAAA AAACTTGCAG GAAGAATAGA TAACACCACA GTTCCCTATT CATTTATTCA AGCTCTTGCT TTAGCGGCGG ACAAAAATAT CAGTTCACGA GAAATGCTAA AGATAGATAA TAACATTCTT TCAGACCAGA GCTGGGCCAA AGTGGTTGAT ATTGCAGAAA ACAAGGTATT GCCTTATTTG CTTCAGATAA ATAACTTTGG CATTGTGGAT TTTGAAAAGT GGCTACCTTA CTATCCCATT GTCACGATGA TGATTGCACT CTTTTTGAAA TTTGAGCATC CTGACACAGA TAAAATTGAA AAATGGTACT GGAGTGCAGT TTTTTCTGAA CGATATTCAG GTTCTACTGA AACTGCTATG GCAAAAGACT TCAAAGAAGT ATGTGTCTGG TTTAATAATA ATAACTTTTT ACCTGAGGTT GTGGAAAAAT TAAGAAATCA ATTAGAGAGT AATGTATATA CCTTGAAAGA GGTAAGGAGA AAGGGAAGTT CAAAATATAT CGGAATTTTC AATCTTTTAT TTAAAAACGG AGCAAAGGAT TTTTATTATC CTGAGAACAT TGCTTTTAAC CAGCTTGATG ACCATCATAT TTTTCCAGTG AGCTTTTTGA AAGTCAAAGG TGTGGAGGTT GATGTTGACT CAATTATGAA CAGGACATTG ATTTTTGAAA ATACTAACAG AAGCATATCT CGTCGCAGTC CCGGTGATTA CATAAGAAAG ATGATTGAAA TTCAAAAATC AAAAGGGCTC TCAGAGCAAG AAGCAGAACA CAAGGTAAAA GAGATATTAA GGGGCCATTT CATTGATGAA GAAATGTATA TATTATTGAA AAACACTACT GATAATCTGA CACCTTCTGA GATTAAAGAG AATTTTGAAA GATTTATAAG TAAACGAGAA AAGTTAATTT TGAATGAGAT AAAAAGGCTG ATATGGTAA
|
Protein sequence | MNLQQGQMNL LDLVRKAYNG ECMLPDFQRN FVWTRYDIEE LIKSLLQGMF IGTFLILETN PQSVPFKVIF VEGAEKVNPQ ICEQPKILIL DGQQRLTSLF YAIYSPDIPL RNSENPYAFF IDLEKLAEDN IEDAVFSWSK KWREFKEIID ENGDYNLEVL KAKKVLPLTV FKDIPEFYRL WFGEYKLLFK DQEANKIFAY IDNMIKYNIF TLSLGLSYND KPDEIAALFE KINRSGVKLS IYDLLVARFY KFIRLREKWE EVFENSVNIK KLAGRIDNTT VPYSFIQALA LAADKNISSR EMLKIDNNIL SDQSWAKVVD IAENKVLPYL LQINNFGIVD FEKWLPYYPI VTMMIALFLK FEHPDTDKIE KWYWSAVFSE RYSGSTETAM AKDFKEVCVW FNNNNFLPEV VEKLRNQLES NVYTLKEVRR KGSSKYIGIF NLLFKNGAKD FYYPENIAFN QLDDHHIFPV SFLKVKGVEV DVDSIMNRTL IFENTNRSIS RRSPGDYIRK MIEIQKSKGL SEQEAEHKVK EILRGHFIDE EMYILLKNTT DNLTPSEIKE NFERFISKRE KLILNEIKRL IW
|
| |