Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1863 |
Symbol | |
ID | 7408976 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1956352 |
End bp | 1957812 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 643716235 |
Product | hypothetical protein |
Protein accession | YP_002573724 |
Protein GI | 222529842 |
COG category | [S] Function unknown |
COG ID | [COG5542] Predicted integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000351046 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAACA AGATGTCAAA TAGAAAGTAT GAAGCAGTAA ACAAGCTTGA CGTTGCAAAT AAAAAAGTTG TAGCAATAAT AATATACAGT ATACTAATTA CAATATTGCC GTTAGGCTGG TGGATGGCTA ATTTCGGTGA AGTGATGTTT AAAAGCAGGC AGCCGAATAG AATAATAGTG AGCATTATAA CAATAAGTGG AATTATTTTT TACAGCTACT ATTTAAACTT TTACAACAAC AAATCGCAGA TTTCAAGAAC AGAATTCATA TGGATGCTAT TTACCGCATT TTTTGTAAGA TTGGCTTTAG GCGGTGCAAT ATCAGGACAT CCGATAGATG TAAATTGTTT CAAGACATGG ATGAACATTG CTTCACCTGA TATTTTCAAC ATTTACGAGA AGAACATTTT TATTGACTAC CTGCCAGGAT ATTTAATCAT CTTAGCATTA TTTAAATACA TTGCTTCGGT AGTCCACTGT ACAATTTCAG AGGACCTACT TATTAAATTA CCAAACATTT TAGCTGACAT TGCCATTGGA GTAATGGTGT TAGGCCTTAA GCAAAAGTGT GGAAGTCGAC CAACTCGTGA GGAAAGTACA ATAATTTTGT TTAATCCCGC ACTAATTTTG CTCTCATCAC TGTGGGGACA GAGTGACAGC TTTGTACTGA TGTTGCTGGT TATGCTATTT GTTGGTTTAG TTTACAATAC CTATATTTTA GCTGGATGTG TAACAGCCTA TTTATTATTC ACAAAACCTC AATTTATACT ATATTTTCCA CTAATAGCAC TATACTGGGT ATACAATCTT TGGCTTAGAA ATAGAACAGT GGACATAATA AAACAAATAA TTTCTTTTTG CGTTGTAAGC GGCGTTTTGT ATTTTCTTTT CATGCCTCAT AAGGAAATTT TATGGCTTCT AAAGTTCTTT ATAAAAATAG CAGGAGAATA TCCTTATTAC ACAGTAAATG CATTTAATAT CTATTATGCT GGAGGCTTAA ATTGGGTCAA AACAGGAGGA ATGTACGATT ACATAAATAT GATTGCTTTA GCAATTGCCT ACATTTTGGT TTTGGCGCAG ATTGGTGTTA GGTTGGAAAA ACCAAGAGAT CATTTAATCC AAGTGAGGTA TCTCCTGCAA GGTGGTTTTT TAGTTGGATT ACTTTCATAT AACCTTATGA CAGGTATGCA TGAAAGATAT TCGATATTTG CATGGTTATT TTGTCTTATG GTATATGTAC TGATTAATGA CATAAGATGG TTAAGGATAT CAATGATTAT TTCCTTGCTA ACCTTTCTGA ATATATCGAA GGTTTTGGAT CTATCTCTTT CAAACATTTA TTTTGTTCAA CGAGATCTTT CGAACTTTAT AGTGGCAATA GCGAATATTG TCATTCTTTT TGCATCACTT TATATTATTA CTAACAGCCT ATTAAAGCAC AAAAATAATG CAAAAGTATG A
|
Protein sequence | MKNKMSNRKY EAVNKLDVAN KKVVAIIIYS ILITILPLGW WMANFGEVMF KSRQPNRIIV SIITISGIIF YSYYLNFYNN KSQISRTEFI WMLFTAFFVR LALGGAISGH PIDVNCFKTW MNIASPDIFN IYEKNIFIDY LPGYLIILAL FKYIASVVHC TISEDLLIKL PNILADIAIG VMVLGLKQKC GSRPTREEST IILFNPALIL LSSLWGQSDS FVLMLLVMLF VGLVYNTYIL AGCVTAYLLF TKPQFILYFP LIALYWVYNL WLRNRTVDII KQIISFCVVS GVLYFLFMPH KEILWLLKFF IKIAGEYPYY TVNAFNIYYA GGLNWVKTGG MYDYINMIAL AIAYILVLAQ IGVRLEKPRD HLIQVRYLLQ GGFLVGLLSY NLMTGMHERY SIFAWLFCLM VYVLINDIRW LRISMIISLL TFLNISKVLD LSLSNIYFVQ RDLSNFIVAI ANIVILFASL YIITNSLLKH KNNAKV
|
| |