Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0746 |
Symbol | |
ID | 7408440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 834397 |
End bp | 835752 |
Gene Length | 1356 bp |
Protein Length | 451 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643715118 |
Product | hypothetical protein |
Protein accession | YP_002572634 |
Protein GI | 222528752 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000789691 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAACA TCATAGAACT TGAGACACTT TTAAATTCAG CTCAAAACAA TCAAGAGTTT GAAGATTTGG ACAATATAGA GATAAAAGAA TTTCCTCAGC CTATGCCTGC AGAGGTATAT CGAGGAATGG TAGGTACTAT TGTAAAATAT CTTGAAAACT TTACTGAAGC AGCTCCTGAA GCTTTGCTTA TCAACTTATT GGTAGTATTT GGTGCCATTG TTGGGAAGGA AGCATGGATA GAGGTAGGAG GCGACAGGCA TTATCCAAAT CTCTTTGCTG TCTTGGTAGG AGATACTGCA AGTGGACGAA AAGGGTCAAG TTGGTCAATT ATTGAAAGGG TATTGGAGAA AGCTGACAAA AATTTTGTAT TAAACAATTT AAGAAATGGT ACAGTGTCAG GTGAGGGTAT TATATATCAT GTCAGAGACC CTATTTTCAA GTGGGACAAG AACTCTGAGA CTTATGAAAT GATAGACCCC GGCGTTGAGG ATAAGAGGTT ACTTATTATT GAGTCTGAGT TTGCCTCTCT ACTTAGGGTT ATGAAAAGAG AAGGGAACAC AATTTCTCCA TTGTTAAGGA ATGCATGGGA TGGCAAATAC AAATTAGAGA CACTCTCAAA AACAAATTAC ACAAAGGCAA CTAATGCTCA TATCTCATTG ATTGGGCATA TAACGTTTGA TGAACTGAAA AAAGAATTAT CAGACGTTGA GAAAATGAAC GGTTTTGGCA ACAGGTTTTT ATGGGTATGT ACACGAAGAA GCAAACTGTT ACCTAATCCA CCGTTATTAC CAGAGGACAA GCTTACAGGT TGGGGATTAT TATTAAGAGA GAGTATTTCA AAAGCACCAA AAGGTTTAAT TACAAAAACT CCTGCAGCTG AAGAAGCTTG GGCACTTATA TATGAAAAAT ACGCAGACAA GGGAGAAGGT GAGACAGCGG CTTTAATAGG CAGGGCAGAA GCACAGATTT TGAGATTAAG CCTAATATAT GCTTTATTAG ATGGGAGCGA GAAAATTACT CATGAACATA TATGCACTGC AAGGTTGGTG TGGGAGTACT GTCAAAAATC TGTTGAATTC ATTTTCAGTG AATTCAACAG AGAAAAAGAA AGCTCAATGG TTTTAAATTT ATTGAGCGCA CTAAAAGAAA AACCATTGAG CCAAAGCGAA ATTTATGAGG TTTTCAACAA ACATATCAAT GCCAAGAAAA TGGCTTATTT GCTAAAAAAG ATGAGTACAA AAGGTTATAT AGAAGCAAAG AAAGAAAGAA GCAACGGCCG ACCAAAAACA CGCTGGTACA TTACACCACT TGGTCTAAAG AAACTGGAAT CTTCTAATAT CGATTTTGCT TCTTAA
|
Protein sequence | MSNIIELETL LNSAQNNQEF EDLDNIEIKE FPQPMPAEVY RGMVGTIVKY LENFTEAAPE ALLINLLVVF GAIVGKEAWI EVGGDRHYPN LFAVLVGDTA SGRKGSSWSI IERVLEKADK NFVLNNLRNG TVSGEGIIYH VRDPIFKWDK NSETYEMIDP GVEDKRLLII ESEFASLLRV MKREGNTISP LLRNAWDGKY KLETLSKTNY TKATNAHISL IGHITFDELK KELSDVEKMN GFGNRFLWVC TRRSKLLPNP PLLPEDKLTG WGLLLRESIS KAPKGLITKT PAAEEAWALI YEKYADKGEG ETAALIGRAE AQILRLSLIY ALLDGSEKIT HEHICTARLV WEYCQKSVEF IFSEFNREKE SSMVLNLLSA LKEKPLSQSE IYEVFNKHIN AKKMAYLLKK MSTKGYIEAK KERSNGRPKT RWYITPLGLK KLESSNIDFA S
|
| |