Gene Mbar_A3188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbar_A3188 
Symbol 
ID3627130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosarcina barkeri str. Fusaro 
KingdomArchaea 
Replicon accessionNC_007355 
Strand
Start bp4099508 
End bp4101028 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content47% 
IMG OID637702027 
Productthymidine phosphorylase 
Protein accessionYP_306652 
Protein GI73670637 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02645] putative thymidine phosphorylase
[TIGR03327] AMP phosphorylase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0024826 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAATTGA AGTTAGAACA TTTTAACATA AAAATAGGGC AGCACAAGAT ATTACTAAAT 
ATTGCCGATG CAAAGGAACT GGGGGTTAAC CCAGGCGATA GGGTCCGTAT TCGTGGGCGT
GAAAGTATTT CTGCAATTGC GGATACAACG GATGATATGG TTCCTCCAGG CACGCTGGGC
GTTTTTTCCG AGGTATATGA GCACTTTGTG AACTGGGATA AACCGGTCGA AGTTGTTCCG
GCATTCCGTT CTAAATCCGC ATCCGTGATC AAGAAAATGA TGGATAAAAA ACCTGTTGTG
CAGGAAGAAA TTAAAACACT CGTAAACGAT ATAGTGGAAG AAAATCTCAG TGAAATCGAA
CTTTCGGCAT TTATAACATC TTCTTATATT CACGGAATGA CCGATGATGA GGTCGAATGG
CTTACAAGAG CTATGATTGA GAGCGGAGAC ACGATTGAGT TTGACACTCA TCCTATAATG
GACAAACACT CGATAGGAGG AGTGCCCGGA AACAAGATCT CCCTCCTTGT TGTTCCTATT
ATCGCTGCAA ACGGGCTTCT TATTCCGAAG ACGAGTTCAA GGGCGATTAC AGGGGCAGGT
GGAACTGCTG ACCTTATGGA AGTGCTCTGT CCGGTAGAGT TCAGTTCCCA AGAAGTCAAA
GAGATAACTG AAAAAGTCGG GGGCGCTCTT GTCTGGGGCG GAGCCACAAA TATAGCGCCT
GCCGATGACA AGCTCATAAG GGTTGAATAC CCTCTCTCCA TTGATCCTTA CTACCAGATG
CTCGCCTCGA TTATGGCAAA AAAAGGAGCT ATCGGGGCCG ACAATGTGGT AATGGACATT
CCGGTTGGGC CGAGCACAAA AGTTCCAACT GTTCAGGAAG GGCAAAAACT TGCAAGAGAC
CTGATTAACC TTGGGCACAG GCTTGGAATG AACGTTGAAT GTGCAATCAC CTATGGTTCG
TCTCCTATTG GAAGAAAAGT AGGACCTTCA CTGGAAGTCA GGGAAGCTCT GAAGGTACTG
GAAAGTATGG AAGGTCCGAA CAGCCTTATT GAAAAGAGTG CGGCTCTGGC AGGTATCCTG
CTTGAGATGG GGGGTGCGGC TCCAAGGGAC CGTGGAAAAG AGATTGCACT GGAAACACTA
AGGAGCGGAA AAGCCCTTGA GAAGATGAAA CAGATTATTG AAGCCCAGGG CGGTGATCCG
AAGATTACCT CGGCTGACAT CCAGGTAGGG CAATATACTG CCGATATTCT CGCTTCTGCG
GACGGATATG TCATCGAGTT TGACAATAAG TGGATAATTG AAATTGCCAG GCTGGCAGGA
GCTCCTAATG ATAAAGGAGC CGGGGTCGCT ATTCACAAGA AAATGGGAGA ATCCGTTAAG
AAGGGAGATC CTATCCTTAC GATCTATGCT GAAAAAGAGT TCAAACTAGA GACCGCATTG
GCAACAGCCC AGAGAACAAA CCCGATAGTT GTTGAGGGCA TGCTTCTTAA GAGAATTCCC
GGAACCTACG GGTTCCAGTA A
 
Protein sequence
MQLKLEHFNI KIGQHKILLN IADAKELGVN PGDRVRIRGR ESISAIADTT DDMVPPGTLG 
VFSEVYEHFV NWDKPVEVVP AFRSKSASVI KKMMDKKPVV QEEIKTLVND IVEENLSEIE
LSAFITSSYI HGMTDDEVEW LTRAMIESGD TIEFDTHPIM DKHSIGGVPG NKISLLVVPI
IAANGLLIPK TSSRAITGAG GTADLMEVLC PVEFSSQEVK EITEKVGGAL VWGGATNIAP
ADDKLIRVEY PLSIDPYYQM LASIMAKKGA IGADNVVMDI PVGPSTKVPT VQEGQKLARD
LINLGHRLGM NVECAITYGS SPIGRKVGPS LEVREALKVL ESMEGPNSLI EKSAALAGIL
LEMGGAAPRD RGKEIALETL RSGKALEKMK QIIEAQGGDP KITSADIQVG QYTADILASA
DGYVIEFDNK WIIEIARLAG APNDKGAGVA IHKKMGESVK KGDPILTIYA EKEFKLETAL
ATAQRTNPIV VEGMLLKRIP GTYGFQ