Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mbar_A3188 |
Symbol | |
ID | 3627130 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methanosarcina barkeri str. Fusaro |
Kingdom | Archaea |
Replicon accession | NC_007355 |
Strand | - |
Start bp | 4099508 |
End bp | 4101028 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 637702027 |
Product | thymidine phosphorylase |
Protein accession | YP_306652 |
Protein GI | 73670637 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0213] Thymidine phosphorylase |
TIGRFAM ID | [TIGR02645] putative thymidine phosphorylase [TIGR03327] AMP phosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0024826 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAATTGA AGTTAGAACA TTTTAACATA AAAATAGGGC AGCACAAGAT ATTACTAAAT ATTGCCGATG CAAAGGAACT GGGGGTTAAC CCAGGCGATA GGGTCCGTAT TCGTGGGCGT GAAAGTATTT CTGCAATTGC GGATACAACG GATGATATGG TTCCTCCAGG CACGCTGGGC GTTTTTTCCG AGGTATATGA GCACTTTGTG AACTGGGATA AACCGGTCGA AGTTGTTCCG GCATTCCGTT CTAAATCCGC ATCCGTGATC AAGAAAATGA TGGATAAAAA ACCTGTTGTG CAGGAAGAAA TTAAAACACT CGTAAACGAT ATAGTGGAAG AAAATCTCAG TGAAATCGAA CTTTCGGCAT TTATAACATC TTCTTATATT CACGGAATGA CCGATGATGA GGTCGAATGG CTTACAAGAG CTATGATTGA GAGCGGAGAC ACGATTGAGT TTGACACTCA TCCTATAATG GACAAACACT CGATAGGAGG AGTGCCCGGA AACAAGATCT CCCTCCTTGT TGTTCCTATT ATCGCTGCAA ACGGGCTTCT TATTCCGAAG ACGAGTTCAA GGGCGATTAC AGGGGCAGGT GGAACTGCTG ACCTTATGGA AGTGCTCTGT CCGGTAGAGT TCAGTTCCCA AGAAGTCAAA GAGATAACTG AAAAAGTCGG GGGCGCTCTT GTCTGGGGCG GAGCCACAAA TATAGCGCCT GCCGATGACA AGCTCATAAG GGTTGAATAC CCTCTCTCCA TTGATCCTTA CTACCAGATG CTCGCCTCGA TTATGGCAAA AAAAGGAGCT ATCGGGGCCG ACAATGTGGT AATGGACATT CCGGTTGGGC CGAGCACAAA AGTTCCAACT GTTCAGGAAG GGCAAAAACT TGCAAGAGAC CTGATTAACC TTGGGCACAG GCTTGGAATG AACGTTGAAT GTGCAATCAC CTATGGTTCG TCTCCTATTG GAAGAAAAGT AGGACCTTCA CTGGAAGTCA GGGAAGCTCT GAAGGTACTG GAAAGTATGG AAGGTCCGAA CAGCCTTATT GAAAAGAGTG CGGCTCTGGC AGGTATCCTG CTTGAGATGG GGGGTGCGGC TCCAAGGGAC CGTGGAAAAG AGATTGCACT GGAAACACTA AGGAGCGGAA AAGCCCTTGA GAAGATGAAA CAGATTATTG AAGCCCAGGG CGGTGATCCG AAGATTACCT CGGCTGACAT CCAGGTAGGG CAATATACTG CCGATATTCT CGCTTCTGCG GACGGATATG TCATCGAGTT TGACAATAAG TGGATAATTG AAATTGCCAG GCTGGCAGGA GCTCCTAATG ATAAAGGAGC CGGGGTCGCT ATTCACAAGA AAATGGGAGA ATCCGTTAAG AAGGGAGATC CTATCCTTAC GATCTATGCT GAAAAAGAGT TCAAACTAGA GACCGCATTG GCAACAGCCC AGAGAACAAA CCCGATAGTT GTTGAGGGCA TGCTTCTTAA GAGAATTCCC GGAACCTACG GGTTCCAGTA A
|
Protein sequence | MQLKLEHFNI KIGQHKILLN IADAKELGVN PGDRVRIRGR ESISAIADTT DDMVPPGTLG VFSEVYEHFV NWDKPVEVVP AFRSKSASVI KKMMDKKPVV QEEIKTLVND IVEENLSEIE LSAFITSSYI HGMTDDEVEW LTRAMIESGD TIEFDTHPIM DKHSIGGVPG NKISLLVVPI IAANGLLIPK TSSRAITGAG GTADLMEVLC PVEFSSQEVK EITEKVGGAL VWGGATNIAP ADDKLIRVEY PLSIDPYYQM LASIMAKKGA IGADNVVMDI PVGPSTKVPT VQEGQKLARD LINLGHRLGM NVECAITYGS SPIGRKVGPS LEVREALKVL ESMEGPNSLI EKSAALAGIL LEMGGAAPRD RGKEIALETL RSGKALEKMK QIIEAQGGDP KITSADIQVG QYTADILASA DGYVIEFDNK WIIEIARLAG APNDKGAGVA IHKKMGESVK KGDPILTIYA EKEFKLETAL ATAQRTNPIV VEGMLLKRIP GTYGFQ
|
| |