Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_2183 |
Symbol | |
ID | 7408376 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 2312099 |
End bp | 2313454 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643716548 |
Product | protein of unknown function UPF0027 |
Protein accession | YP_002574031 |
Protein GI | 222530149 |
COG category | [S] Function unknown |
COG ID | [COG1690] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00364968 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA TAAGAGACGG TGTTTATACA AACGATTATG CCATATTCTT CATGACAGAA GAAATTTTAA AGGACCTTGA CGAAGGGGTG CTCCAGCAGG CAAAAAACGC ATCCCAAATT CCAAATGTAG AATTTTTGGG CTATACACCA GATGCACACA TAGGCAAAGG TACTTCAATT GGCACAATAA TCGTTTGGGA CATGTCAAAG GCGTGGATTT CACCAACAAT TGTTGGTGTT GACATAGGTT GTGGTATGAG ACTGATTCTG ACAGACAAGT TTGCAGATGA TATAGATAAA GCACTTTTGA AGAAAATAAT GGATGAGGTA GAAGATTTGA TTCCAACAGG TGTTGGTAAG AAAAACAAAA AGATAGCTCT TTCCAAGACA AAGTATGAAG AGTATCTTCA AAATACAGAG ATTGATAAGG ACATTTCAGA CAAGATGGTT CTCATTCATG AGTTTGACCT TGACACAATA CCGGATGAGG CTCATGAGAT TGGTAAAGAG CAATTTGCAA CCTTGGGTGG AGGCAACCAC TTTATAGAGT TTCAAAAACT TCATGTCATA GATAAAATTA TTGCAGAAAA ATGGGGACTT TTCGATGGGC AGTTTGTTGT GATGATACAT TCTGGTTCGA GAAGGTTTGG AGCGGTCATT GGCGATTATT ATCAAAAGAA ATTTAAAGAC GTTATGAAAT CCAAGGGTAT CACTACGCCA GACCCGCAGC TTACCTTTTT GCCAATTGAC AACAAGGTTG CAAAAGATTA TATTAAAGCT ATGCAGTCAG CAGCTATTTA TGCAAAAATA AATAGACATT ATATGAGCAA CTTTATAATA TCAGTCTTAG AAAAACACTC AATTGACGCT TGGGTTTTAT ATGACGTTGC ACATAACATT GCATACATGG AAAGATTTGC AAACAGAGAA AAGCTTGTTA TAAGAAAAGG GGCAACAAGA GCATTACCGC CAAACCACTA TTTGATTCCG AATCCTAAAT TTGCTGAGAC AGGACATCCT GTGATTTTAC CTGGCAGTAT GGGTTCAAGT TCATATCTTA TGAGGGGAAT TGAGGACAAT ATAATAAGTT ATCATACAGT CAACCATGGA GCAGGCAGGG TTTTATCACG AACAAAGGCA AAAAAGACAA TTTCCATTGA AGAATTTTCA AAAGCTTTAA AACAGGGGCA AAGCGGAGAG ATTCTTATAA ACACTAAAAA CCTAAAAGAT TTTTTAGATG AAAGTCCACA GAGTTATAAA GACATTGAAC TTGTGATAAA TTCAGTAATT ACATCCAGGC TTGCTACTCC TGTTGCCAAA ATGGAGCCGC TTGGGGTCAT AAAAGGAAAA GATTAA
|
Protein sequence | MKKIRDGVYT NDYAIFFMTE EILKDLDEGV LQQAKNASQI PNVEFLGYTP DAHIGKGTSI GTIIVWDMSK AWISPTIVGV DIGCGMRLIL TDKFADDIDK ALLKKIMDEV EDLIPTGVGK KNKKIALSKT KYEEYLQNTE IDKDISDKMV LIHEFDLDTI PDEAHEIGKE QFATLGGGNH FIEFQKLHVI DKIIAEKWGL FDGQFVVMIH SGSRRFGAVI GDYYQKKFKD VMKSKGITTP DPQLTFLPID NKVAKDYIKA MQSAAIYAKI NRHYMSNFII SVLEKHSIDA WVLYDVAHNI AYMERFANRE KLVIRKGATR ALPPNHYLIP NPKFAETGHP VILPGSMGSS SYLMRGIEDN IISYHTVNHG AGRVLSRTKA KKTISIEEFS KALKQGQSGE ILINTKNLKD FLDESPQSYK DIELVINSVI TSRLATPVAK MEPLGVIKGK D
|
| |