Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0988 |
Symbol | |
ID | 7407889 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 1090400 |
End bp | 1091965 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643715353 |
Product | RNA binding metal dependent phosphohydrolase |
Protein accession | YP_002572862 |
Protein GI | 222528980 |
COG category | [R] General function prediction only |
COG ID | [COG1418] Predicted HD superfamily hydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG [TIGR03319] conserved hypothetical protein YmdA/YtgF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.549745 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCAAAAGA TTAGTGAGAC ATTGAAGTTT GTAATTATTG GAGCAATTGT TTTGGTAGCA TCAATGGTTG CCTTTTTCTT GGGCTATTTA TATAGAAAGA AGATTGCAGA AAAGACCATA AAAAGTGCTG AACAAGAAGC CCAAAGAATT GTTGAGGAGG CCAAAAAACA GGCCGAGGCA TACAAAAAGG AGGCAACACT CCTTGCAAAA GAAGAGATAC ACAGAGCAAG GAGCGAATTT GACAGAGAGG TAAGAGAACG AAGAGCAGAG CTCCAGAGGT TCGAAAGAAG ACTTATCCAA AAAGAAGAGA TGCTTGACAA AAAGATGGCG TCTGTAGAGG AAAAAGAAGA GCAGCTCAAT CAGAAGCTGA AAGATATTCA AAAACTTCAA GAGGAGATTG AACTTTTAAA ACAAAAAGAG CAGGAAGAGC TGCAAAGAAT TTCTGGGCTC ACTCAGGAAG AAGCAAAACA AATTATACTC AAGAGTGTTG AACAAGATGT CAAACATGAC GTTGCGCTCA TGATAAAAGA GCTTGAACAG CAAGCAAAAG AAGAGGCTGA CAAAAAAGCC AGAGAAATTA TTGCTACTGC TATCCAACGC TATTCGTCAG ACTATGTTGC AGAAAACACT GTTTCTGTTG TGACACTCCC AAATGATGAA ATGAAAGGTA GAATCATAGG TAGAGAAGGA AGAAACATAA AGACATTTGA GACTGTCACA GGAATAGACC TTATAATTGA CGACACACCC GAGGCAGTAA TATTATCAGG GTTTGACCCG ATAAGGCGTG AGATAGCAAA ATTGACGCTT GAAAAGCTCA TTTTAGATGG GCGAATACAT CCTGCGCGAA TTGAAGAAAT GTACGAAAAA GCAAAACGAG AGGTTGAGAA TAAGATTCGA GAAGAAGGAG AAAGGGTTGT ATTTGAGCTT GGGATTCACA ACTTGCATCC AGAACTCATT AAGCTCATAG GAAAACTTAA GTACAGAACA AGCTACGGTC AAAATGTTCT TGCACATTCT ATTGAGGTAG CAAACATAGC AGGTATCATG GCAGCAGAGC TCGGTCTTGA CCAGAGCATT GCAAAGCGTG CAGGTCTTTT GCATGACATT GGCAAAGCAG TTGACCATGA AATGGAAGGG TCACATGCCC TGATTGGTTA TGAGCTTGCT AAAAAATACA AGGAGACAAA CCCGGATGTC CTTGAAGCGA TTGGTGGGCA TCACGGTGAG ATGGAAACAA GGTCAATTTA CAATGTGTTA ATTCAGGCTG CTGACTCTGT TTCAGCGGCA CGACCAGGAG CTCGAAGAGA ATCTCTTGAG TCTTATATCA AAAGACTTCA GAAACTTGAA GAGATTGCGA ATTCTTTTGA TGGTGTTGAA AAGGCTTATG CAATTCAAGC AGGAAGAGAG ATAAGAATAA TGGTAAAGCC TGACCATGTG AGTGATGATG ATATTGTTAT AATGGCAAGA GAGATAGTAA AGAGAATTGA AAGTGAGCTT GATTATCCAG GTCAGATAAA GGTAAATGTA ATCAGAGAAG TTCGAGCAGT TGAATATGCA AAATGA
|
Protein sequence | MQKISETLKF VIIGAIVLVA SMVAFFLGYL YRKKIAEKTI KSAEQEAQRI VEEAKKQAEA YKKEATLLAK EEIHRARSEF DREVRERRAE LQRFERRLIQ KEEMLDKKMA SVEEKEEQLN QKLKDIQKLQ EEIELLKQKE QEELQRISGL TQEEAKQIIL KSVEQDVKHD VALMIKELEQ QAKEEADKKA REIIATAIQR YSSDYVAENT VSVVTLPNDE MKGRIIGREG RNIKTFETVT GIDLIIDDTP EAVILSGFDP IRREIAKLTL EKLILDGRIH PARIEEMYEK AKREVENKIR EEGERVVFEL GIHNLHPELI KLIGKLKYRT SYGQNVLAHS IEVANIAGIM AAELGLDQSI AKRAGLLHDI GKAVDHEMEG SHALIGYELA KKYKETNPDV LEAIGGHHGE METRSIYNVL IQAADSVSAA RPGARRESLE SYIKRLQKLE EIANSFDGVE KAYAIQAGRE IRIMVKPDHV SDDDIVIMAR EIVKRIESEL DYPGQIKVNV IREVRAVEYA K
|
| |