Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_1775 |
Symbol | |
ID | 7408562 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | - |
Start bp | 1849673 |
End bp | 1850998 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643716152 |
Product | pyrimidine-nucleoside phosphorylase |
Protein accession | YP_002573641 |
Protein GI | 222529759 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02644] pyrimidine-nucleoside phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTGGTGA CAGAGATTAT TAGGAAAAAA CGTGACGGAG AGATACTGAG CAAAGAAGAG CTTGAATTTA TTGTAAACGG ATATGTAAAA GGCGAAATTC CAGACTATCA GATGTCTGCT TTCTTGATGG CTATATATTT TAGAGGGATG TCAAAAGATG AGCTTGTAGA ACTTACCATG CTCATGGCAA AATCAGGTAA GATGGTAGAC CTGAGCAGTA TAGAGGGTAT CAAGGTTGAT AAACACTCAA GTGGTGGTAT TGCTGATACA ACAACCCTAG TCTTAATACC ACTGGCAGCA TCTTGCGGTG TAAAGGTTGC AAAGATGTCA GGACGAGGGC TTTCTCACAC AGGTGGGACA ATTGACAAGT TGGAGTCAAT TCCAGGTTTT AAAACAGAGC TTTCAGAAGA TGAGTTTATA AAAGCAGTAA ATAAAGTTGG TGCGGCAATT GTTGGGCAGT CAGAAAGCCT TGTTCCTGCA GACAAAAAGA TATATGCTTT AAGAGATGTC ACAGGTACAG TTGAATCTAT ACCACTTATA GCATCTTCCA TCATGAGCAA AAAGATTGCA GCTGGTAGTG ATAAGATAAT ACTTGATGTC AAGTTTGGCA AAGGTGCTTT CATGAAAGAG TATGAAAAGG CAAAAGAGCT TGCTAATACT ATGGTTGAGA TTGGAACTTT GGCAGGAAGA GAAACAGTGG CATATGTTAC AGATATGAAT CAGCCACTTG GTCTTATGAT TGGCAACGCT CTTGAGGTTA TTGAAGCAAT AGAGGTCTTA AAAGGAAGAG GACATGAAGA TTTGAAGAAT CTTTGTATTG AATTTGCATC TGAGATGATG ATTATGGCTG GAGTTGAAAA GGAGAAAAAA TTAGCACAAG AAAGAGCAAT TGAGAGCATT GAAAAAGGGC ATGCTCTCAA AAAATTTAGA GAAATTATAA AAAACCAGGG TGGAAATCCT GAAATAGTTG ACAATTATTC ATTGCTGCCA CAAGCCAAAT ATATTTATGA ACTAAAATGC GACGAAGATA TGTATATTAA AGATATTGAT GCTCTCAAAC TTGGGCTTTG CGCACTAAAA CTTGGAGCAG GAAGACAAAG AAAAGAAGAC AAGATTGACT ACGCAGTTGG AATTCAACTT TTTGGTAAAA TAGGCGACAA AATAGCCAAG AATATGCCGT TTGCTAAAAT CTATGCAAAT GATGAAAAAA GGGTTGAAGA AGCCATCTCA GATGTGAAAA CTGCATTTGA GTTTTCAAAA GTACCTGTTC CAAAAAGAAA AGTAATATTT GCAAAGATAA CAAAAGATAA TGTTTTTGAA TTTTAA
|
Protein sequence | MLVTEIIRKK RDGEILSKEE LEFIVNGYVK GEIPDYQMSA FLMAIYFRGM SKDELVELTM LMAKSGKMVD LSSIEGIKVD KHSSGGIADT TTLVLIPLAA SCGVKVAKMS GRGLSHTGGT IDKLESIPGF KTELSEDEFI KAVNKVGAAI VGQSESLVPA DKKIYALRDV TGTVESIPLI ASSIMSKKIA AGSDKIILDV KFGKGAFMKE YEKAKELANT MVEIGTLAGR ETVAYVTDMN QPLGLMIGNA LEVIEAIEVL KGRGHEDLKN LCIEFASEMM IMAGVEKEKK LAQERAIESI EKGHALKKFR EIIKNQGGNP EIVDNYSLLP QAKYIYELKC DEDMYIKDID ALKLGLCALK LGAGRQRKED KIDYAVGIQL FGKIGDKIAK NMPFAKIYAN DEKRVEEAIS DVKTAFEFSK VPVPKRKVIF AKITKDNVFE F
|
| |