Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1573 |
Symbol | |
ID | 7409082 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1663095 |
End bp | 1666262 |
Gene Length | 3168 bp |
Protein Length | 1055 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643715944 |
Product | hypothetical protein |
Protein accession | YP_002573442 |
Protein GI | 222529560 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATATA AAAATAGACA CAGATTTTTA TTTTTGGTTA TCTGTGTTAT TACTCAACTT TTAATAATGA AAACTGTGAG TGCACAAACA AAAGAATATT TATCGATTGA TATGTACAAG CTTGCAGTGG AAAAGTTGTC TAAGATAGGT GTTGTATCAT CAAAAGATTA TCTCAAGCCA AACAGTTACG TGACTCGTCA GGAGTTTGTT AGGGCTATTG CAAAAATTTC AAATGTTGAT GAAAAAACTT TAGTTCAACA ACCATTTTCA TACTGTATTG ATGTAAAATC TAATACAGAA CTTTGTGGTT ATGTGAATTG GGCCGTTAAA AATAAGTATT TGAATATTTC AATAGATGGG AAATTTAGAC CATTTGAAGA AATTACATTT TCTCATGCAG TAACTGCAAT GGTAAAAATG CTAAATTATA CTGATTCAGA TATAGAGGGT ATCTGGCCTT ATAACTATAT CAGAAAGGCT TCTGAACTTG GACTGTTGAA AGGCTTAAAT GTGTCGGCTA AGAAAAAAGT TACAAAATAT GATTTGGCCA TTATGCTATA TAGGCTGCTT GAAACAAATG TAAAAAATAC AAATACAAAA TTTTCTGAAT ATGTTGGGCT TTACAAATCA TACGTTGTGC TTGATACTGG CAAGACATCA TCAAAGCTTC TTCCAAATGA GGTTTTGACA GACAGCAGAG TGCTTGTGAA TGCAACAAAG ACGCAACTTG AAGTTGGGAA AAAGTACATG CTTCAGGTTG ATACCAATAA AATCACGAAA GTGTTTGGCA CTGAAGCTGA TTCTTTCCAG ATTGTCAGCA CAAAGGTAAG CAGCAGGACT GTATATTACA AGGAAAGTGG AAAGACAAAA TCAATAACTT TGCCATCTTC GGCAACATAC TATTACAACG GCTCAAAGCA GAGCTATGAT GCAGTAGAAA ATGTGCTAAA ACCGAACCAG AAAATAAGCT TTATATATTC TGAAGATAGG AGCAAAATGG ACTTTGTTGT AATTTCCGAC ATATATGCAC AAGAGATTTA TGGAAACTAC GATGAGGTTT TGATTTTGGC AACTCCAAAA ACATCATCAG CACTGGATGC AAACCAGGTT CAGACAGACA AGGGAATATA CTTTGTTGCA TCTTCAATAA AACCTGAAAA CCTTGAAATT GGAGCAAAGT ATGGAGTGTA TATAAAAGAT GATACAATCA CTGCAGCTTT GCAAAAAGTG TGGGTATCAG AAAAGTTTAC AATTACAAAT ATAGATGATT ACACACTTGA TGCTGCTCAA AACGGCAAAA CACAAAGGAT TCAGCTGACA AGCAAACCTT TATATTACTA TCAGGGAACA AAACAGAGCT ATGAAAATTT ACCGAACATT TTAAAAGAAG ACCAGATACT CTATGTATCA AAAGACCCTG ACACAGGCAA GGTTATGGCA TATGTTATAC AAGACCCATA TGGTACACAG TATGGAAACT ATATTGAGGC AATAGTTTTG CAGGATGCAC TTTTAAACTC AAGCCTTGAG AACAACCAGG TTGTGACTGA CAAAGGTATT TTTTACCTTC CATCTGCCGA GACAAAACTT GAAATAGGTG CAAAGTACGG ACTTTATGTT AAAGACGATA AGATAACGCT TGTTGTAAGG AAGTTGAATA ATACTCAGCA ATATGAGGTA ACAGATGTTG TAGGTGATAC TAACGTCAAG CTAAAAGGTA CTCAGGGGCA GGAGAACATT ATTCTACCTC AAAAGCCGGT TTATTATTAC AATGGTGCAA AAACAAACTA CCTTGAGCTC AAGAATGTGC TAAAAGTGGG TCAGAAAATC TATTTTGGTT TTGCAAAGGA CGGCAAGACA TACGAATATG TTGTAATTCA GGACCCATAC TCTTTTGAGT ATGGCACATA CACAGAGGTC ATAGTGATGG CAGATAGCAT TTCGTCAAGC AAGCTTGCAA CAAACGAGGT TCTAACAGAC AAGGGCATAT ATGTTGTGGG AAGATCTGCA GGAAAGCTTT CTGTTGGTGC AAAGTATGGT GTGTACATCA AAAATGATAC AATCACCAAG GTTGTGAAAA AGCTCAACAA CGTGGAGTCA GCAGAGATTA CAGCAGTTGT GAGCGCTACA TCCGTGAAGC TCAAAAAAAG CAGCACTGAG AATGCGCTGC CTCTTCCTCA AAAACCTACG TACTATTACA ATGGAAATAA GCTCAGCTAT GACCAGCTAA AGAGTATTCT CAAAACAGGT CAGAAAATCT ATTTTGGCTA CAACCAGGCA GGAAATTCGT ATGAGTATGC AATCATCCAG GACCCATATT ATGATGAGTA TGGGACATAC ATGGAAGTTA TTGTAATGGG AACAAGCAAG GTAACAAAAG GTCTTGCAGA CAATGAAGTT CTAACAGACA AAGGCATTTT GACCCTGCCG TCAAACCAGA ATATAAACTT GGAGCTTGGC GCAAAATATG GGCTTTATGT TGATTCAGAT AATCAAATAA CTCTTGTCTA CAAAAAATTC AACTCAACAG ATAGCATGAC AGTTGTGTAT GCTCTTGGAA GCAAAGTAAC AGTTGACAAA GGCGGAACAC AGGTTGAGAT GACCTTGCCA CAAAACATAA TCTACTACTA CAGCGGCACA AAGATTGACT ACTCACAGGT GCTTCAGAAG ATGCAAAGGG CAACATCGCT TGTGTTTGGA ATATCAACAC AAAAGAGCGG GTATGACTAC TGTGTAATAT TTGACCCGGT ATACAGCAAA CCATACCTTG CAAATGAGCA GACATATCTG ACCTTAAAAG CAGGTGATTT GGATATAAGT GGTAGTAGCA AAGTTATAAA AGATGGGGAT GTTGTGGATT ACAGCTATAT TCAGAAAAAC GATGTTGTCT ATGCTGTGTC AGACATCTGG GGTGGTAATA AGTTCATACT TGTTGTAGAT GACAGGGTTG AATGTTACAT CAAAAGCTAC CAGCCAACAA GGTTCACACC AAAGTCAATT GTTGTAAGTG CAGTAGACCT GACAACAGGA AAGCTTGTGG ACAGAACATA TGAGGTGAGC GAAGATTTTG ACCCATCTGT GCTTCTTTCT GACACCTTCA AAGTTGGTCA GAGAGTGTAC CTCATTTTAG GGTACGATGG CAAGGTTGTG AGTATGGTGA AACCATAA
|
Protein sequence | MKYKNRHRFL FLVICVITQL LIMKTVSAQT KEYLSIDMYK LAVEKLSKIG VVSSKDYLKP NSYVTRQEFV RAIAKISNVD EKTLVQQPFS YCIDVKSNTE LCGYVNWAVK NKYLNISIDG KFRPFEEITF SHAVTAMVKM LNYTDSDIEG IWPYNYIRKA SELGLLKGLN VSAKKKVTKY DLAIMLYRLL ETNVKNTNTK FSEYVGLYKS YVVLDTGKTS SKLLPNEVLT DSRVLVNATK TQLEVGKKYM LQVDTNKITK VFGTEADSFQ IVSTKVSSRT VYYKESGKTK SITLPSSATY YYNGSKQSYD AVENVLKPNQ KISFIYSEDR SKMDFVVISD IYAQEIYGNY DEVLILATPK TSSALDANQV QTDKGIYFVA SSIKPENLEI GAKYGVYIKD DTITAALQKV WVSEKFTITN IDDYTLDAAQ NGKTQRIQLT SKPLYYYQGT KQSYENLPNI LKEDQILYVS KDPDTGKVMA YVIQDPYGTQ YGNYIEAIVL QDALLNSSLE NNQVVTDKGI FYLPSAETKL EIGAKYGLYV KDDKITLVVR KLNNTQQYEV TDVVGDTNVK LKGTQGQENI ILPQKPVYYY NGAKTNYLEL KNVLKVGQKI YFGFAKDGKT YEYVVIQDPY SFEYGTYTEV IVMADSISSS KLATNEVLTD KGIYVVGRSA GKLSVGAKYG VYIKNDTITK VVKKLNNVES AEITAVVSAT SVKLKKSSTE NALPLPQKPT YYYNGNKLSY DQLKSILKTG QKIYFGYNQA GNSYEYAIIQ DPYYDEYGTY MEVIVMGTSK VTKGLADNEV LTDKGILTLP SNQNINLELG AKYGLYVDSD NQITLVYKKF NSTDSMTVVY ALGSKVTVDK GGTQVEMTLP QNIIYYYSGT KIDYSQVLQK MQRATSLVFG ISTQKSGYDY CVIFDPVYSK PYLANEQTYL TLKAGDLDIS GSSKVIKDGD VVDYSYIQKN DVVYAVSDIW GGNKFILVVD DRVECYIKSY QPTRFTPKSI VVSAVDLTTG KLVDRTYEVS EDFDPSVLLS DTFKVGQRVY LILGYDGKVV SMVKP
|
| |