Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0535 |
Symbol | |
ID | 7408660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 605538 |
End bp | 606566 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643714917 |
Product | 3D domain protein |
Protein accession | YP_002572434 |
Protein GI | 222528552 |
COG category | [S] Function unknown |
COG ID | [COG3583] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000418011 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAT TTAAGTGCCT TGTGGCAAAG CCAAGGGATA TGAAAAAGCT TATCCTGGCT TTTGTCATTG TATTTGTTTT GTCAGTCCTG CTTGGCGCCA TGACTGCACA GGCACTGGTG AAAGAAGTTA GCATAACAAT TGACGGCAAG ACATTTTATT ATAAAACGAT TAAATCCACA GTAAGAGAGG TTTTAGAAGA AAATCAAATT TACTTGACAA AAGATGACTA TGTCTCGCCT TCTTTGGATT CAAAAATAAA TGAAAATACC CAGATAATAA TAAAAAGAGC TTTTGAAGTG AAAATACTTG TTGGCGACGA GGAAAAAGTT GTATATATTC CAAGCGGTAC TGTTGAGGAT GCTATCAAAA AAGCTGGAGT TGTTCTTGGA AAGTTGGACA AGATAAATCT TCCTCTCTCT CAGCTTCTTG ATAAGTCAAC TGTCATTAAA ATTACTAAGG TGACAGAGAA GGTGGTCGTA GAAAAACAAA AAATACCTTT CAGTACAGTG ACAAAAATAA ACTATAATAT GGACTACGGA AAGCAAAAGG TTATCCAGCA AGGGCAGGAT GGTATTAAAG AAAGAAGATA CAAAGTTGTC TTGGAAGATG GTAAAGAAGT TGAGAGAAAG TTGATTGAGG AAAGAGTTGT CAAAAATTCG AAGCCGAGGA TTGTTGAAGT TGGAGCAATA AGGTGGTTCA AGACATCAAG AGGAGAAGTG GTCAGATACA GAAAAGTTTA TACAATGATA GCAACTGCAT ATTCTTTGAC CCCAAGTGAT ACAGGAAAAA GTCCATCTCA TCCTGATTAT GGCAGAACTG CAACAGGTCA CAAAGTAAAG CGCGGGGTTG TTGCGGTTGA CCCGCGCGTG ATTCCGCTTG GAACAAGGCT TTATATAGAA GGATATGGTT TTGCGAGAGC TCTTGATACA GGTTCTGCTA TCAAGGGAAA CAGGATAGAT GTGTTTGTTG AAAAGGATGC GTATAAATTT GGTGTGCGGC GCGTAAAAGT TTATGTGCTT GCAGACTAA
|
Protein sequence | MNKFKCLVAK PRDMKKLILA FVIVFVLSVL LGAMTAQALV KEVSITIDGK TFYYKTIKST VREVLEENQI YLTKDDYVSP SLDSKINENT QIIIKRAFEV KILVGDEEKV VYIPSGTVED AIKKAGVVLG KLDKINLPLS QLLDKSTVIK ITKVTEKVVV EKQKIPFSTV TKINYNMDYG KQKVIQQGQD GIKERRYKVV LEDGKEVERK LIEERVVKNS KPRIVEVGAI RWFKTSRGEV VRYRKVYTMI ATAYSLTPSD TGKSPSHPDY GRTATGHKVK RGVVAVDPRV IPLGTRLYIE GYGFARALDT GSAIKGNRID VFVEKDAYKF GVRRVKVYVL AD
|
| |