Gene Mlab_0067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlab_0067 
Symbol 
ID4795851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanocorpusculum labreanum Z 
KingdomArchaea 
Replicon accessionNC_008942 
Strand
Start bp70267 
End bp71784 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content53% 
IMG OID640098712 
Productthymidine phosphorylase 
Protein accessionYP_001029512 
Protein GI124484896 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02645] putative thymidine phosphorylase
[TIGR03327] AMP phosphorylase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0000469248 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACCTTA TTCTTGATAT GATCGACATC TCTACACCCG TTATTCTTCT TAACGACACC 
GATGCCCGAC AGATAGGCGT ACTTGAAGGC GATCGTGTCA CCATTACCCG GATAAAAACA
AAACATACCA TTGCGGCTCC AGTATCTATC ACAAAAACAC TCACGAGTCA GGGAACTGCC
ACCATCTCCC TTGGAACCAA TGAAAATCTT GCCGCAGAAA AAGGTGACGA AATAGAAATT
CGTGCCGCTC CACGTCCGGC ATCAATCGCC TTCATCCGGA AAAAAATGGA CGGAGGTAAA
TTTACCCGGG AAGAAACGGC CACAATCATC TCCGACATGT CATCGAACGT TCTCTCCCCC
TCAGAAATCA CCGCCTACAT CACTGCCGCC TACATCAACG GACTCGACAT GGATGAAGTA
GAATTCCTCA CCCGGGAAAT GGTTGCCTCA GGAGAGCAGA TCACCTTCTC GAAAAAACCC
GTCGTTGACA AACACTCGAT AGGAGGCGTT CCCGGAAACA AAATCACACT CCTTGTCGTC
CCTGTGATCG CCGCATCCGG TCTTCTCATT CCAAAAACCA GCTCGCGGGC AATCACCGGA
GCGGGAGGAA CGGCTGATCT TATGGAAGCC CTTGCTCCCG TTGCCTTCTC GGCTGCTGAA
ATCAAAACCA TGACCGAAAA GGCCGGTGGC GTTATCGTTT GGGGCGGTGC AACAAACATC
GCCCCCGCCG ACGATATGAT CGTCACCTAT GAGTATCCTC TGAAAATCGA TGCACGAGGT
CAGATGCTTG CAAGCATCAT GGCAAAGAAA ATGGCAGTCG GCTCCGACAC CTGTGTCATC
GATATTCCGA TCGGTCCCGG TACAAAAATC CCCGATGAAG CAGAAGGCAG GGTACTTGCC
AACGAACTCA TCACCCTTGG GAATCGTCTA GGGATCAGAG TAGAATGCGC CGTCACCTTC
GGCGGCTCTC CCATCGGCAG AAACATCGGC GTCAACCTCG AAGTCTCCGA AGCTCTCAGC
CTCCTCGAAG GAAAACGGGG TGCCAACTCC CTTGTCCAGA AAAGCGTCGC CATTGCAGGA
ATCGCGCTTG AAATGACCGG TAAAACCGGG GCCGATTCGG GAGCGGAAGC TGCATATGAC
ATCATCAAAA AAGGAAAAGC CCTCAAAAAA ATGCTCGACA TCATCGAGAT CCAGGGAGGT
GACCCCAAAG TCAAATCGAC CGACTTCCCC GTTGGCGAAC ACACCTTTGT TGTACCTGCA
GCTTCAGACG GCTACGTGGT CTCCGTTAAA AATCAGGCCC TCATCAGCAT TGCCCGGGCA
GCCGGATCCC CGGTAGATCA CGGCGCAGGT CTCCATCTCC ACAAAAAACC CGGAGAATAC
GTCAAACGCG GAGAGCCGCT TCTTACCATC TACGCCGAAC GCGGATGGCG TCTTACCCGG
GCCATCGAAG AAGCAAGAAC CTCCTACCCC GTCCTTGTGG AAGGTATGCT TCTCGAACGC
ATCTCGAGCA ACCGATGA
 
Protein sequence
MNLILDMIDI STPVILLNDT DARQIGVLEG DRVTITRIKT KHTIAAPVSI TKTLTSQGTA 
TISLGTNENL AAEKGDEIEI RAAPRPASIA FIRKKMDGGK FTREETATII SDMSSNVLSP
SEITAYITAA YINGLDMDEV EFLTREMVAS GEQITFSKKP VVDKHSIGGV PGNKITLLVV
PVIAASGLLI PKTSSRAITG AGGTADLMEA LAPVAFSAAE IKTMTEKAGG VIVWGGATNI
APADDMIVTY EYPLKIDARG QMLASIMAKK MAVGSDTCVI DIPIGPGTKI PDEAEGRVLA
NELITLGNRL GIRVECAVTF GGSPIGRNIG VNLEVSEALS LLEGKRGANS LVQKSVAIAG
IALEMTGKTG ADSGAEAAYD IIKKGKALKK MLDIIEIQGG DPKVKSTDFP VGEHTFVVPA
ASDGYVVSVK NQALISIARA AGSPVDHGAG LHLHKKPGEY VKRGEPLLTI YAERGWRLTR
AIEEARTSYP VLVEGMLLER ISSNR