Gene MmarC5_1346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmarC5_1346 
Symbol 
ID4927907 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcus maripaludis C5 
KingdomArchaea 
Replicon accessionNC_009135 
Strand
Start bp1290720 
End bp1292237 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content37% 
IMG OID640166842 
Productthymidine phosphorylase 
Protein accessionYP_001097858 
Protein GI134046373 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0213] Thymidine phosphorylase 
TIGRFAM ID[TIGR02645] putative thymidine phosphorylase
[TIGR03327] AMP phosphorylase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTATTTT TAAACGCAAA ATTCATAGAC CTTGATCTAG GTGAAAATGC GGTAATTGTC 
AACGAAGAGG ACTTAAAAGG AACTTCGTAC TATCCGCAAG ATAGGGTATT GATTGAGTCA
CACGCAGGTT CCGTCATTGG AAACATTTAT TCCACAAAAA CTATGGTTAA CAAAGGCGAA
GTTGGAATGC TTGTTAGTGA ACTTGCAGAA ATTTCCATTT CAGAAGGGGA AGAAGTAAAA
TTAAGACACG CAGAAAAACC TGAATCAATT CCATTTATAA AAAAGAAGAT GGATGGCCAG
GTTTTAAATC CGCATGAAAT AAGAACGATA ATTGATGAAA TTGTATCTAA AAAACTCTCA
AACATTGAAT TATCTGCTTT TGTTTCGTCT ACGTACATAA ATGGAATGAA TATGGATGAA
ATAAGCGAAA TGACCAAGAG AATCGCAGAA ACTGGGGACA TGATTGCTTG GGAAAAGAGT
TTAGTTGTGG ATATTCACAG CATTGGTGGA GTTCCTGGAA ATAAGTACGC CTTACTTTCA
ATCCCGATAC TTGCAGCAGC AGGAATTACT GTTCCAAAAA CGTCATCAAG AGCAATAACT
TCACCTGCTG GAACCGCAGA TGTAATGGAA GTGCTCACAA ATGTTGAATT GAAAGAAGAA
GAGATTAAAA GAATAGTAAA AACTACAAAT GGATGTCTTG CATGGGGTGG AGGCGTAAAT
CTAGCTCCTG CAGATGATAT AATTATAAAT GTAGAACGGC CGGTTTCAAT AGACCCTCAA
CCACAGCTTC TTGCAAGTGT TATGGCAAAA AAGATTGCAA CAGGAATTAA ATATACTGTA
ATTGATATTC CCGTTGGAAA AGGGGTAAAA ATTAAAAATG AAGCAGAAGG GGCAAAATTA
GCAAGGAAAT TTATTGAACT TGGGGAATCG CTCAATATTA AGGTGGAATG TGTGTTAACT
TACGGTGGGC AGCCACTTGG GAGGGCAATT GGACCTGCAC TCGAAGCAAG AGAAGCAATC
GAAGCACTTC AAGATCCAAA AAATGCTCCA AAAAGTTTAA TTGAAAAGGC ACTATCTCTT
GCAGGAATTC TTCTTGAACT CGGAGGGGCT GCACAGATTG GGGAAGGTCA AAATTTAGCA
TGGGAAATTT TAGAATCCGG AAAAGCACTT GAAAAATTTA ATCAGATTAT AACTGAACAG
GGTGGAACTC CGAAAAAACC CGAAGAAATA GAACTTGGAG ATTATGTTGA AGAAATTCTT
GCGCCAATTG ATGGGTATAT TACAGATATA AGTAACACTG CAATTACAAA CGTGGTTAAA
GAAGCTGGAG CTCCAAGGGA TAAAAAAGCA GGAATTTTAT TGAATTCAAA GATTGGAAAT
AAAGTAAAAC AGGGAGATGT TTTATATACA ATCTATTCCG GATCAGAAGA AAGGCTTGTT
TCAGCAATAA ATCTTGCAAG AAGAGTTTAT CCTGTAAAGG TTGAAGGAAT GCTTATTGAG
AGAATAAGCA AATTCTAA
 
Protein sequence
MLFLNAKFID LDLGENAVIV NEEDLKGTSY YPQDRVLIES HAGSVIGNIY STKTMVNKGE 
VGMLVSELAE ISISEGEEVK LRHAEKPESI PFIKKKMDGQ VLNPHEIRTI IDEIVSKKLS
NIELSAFVSS TYINGMNMDE ISEMTKRIAE TGDMIAWEKS LVVDIHSIGG VPGNKYALLS
IPILAAAGIT VPKTSSRAIT SPAGTADVME VLTNVELKEE EIKRIVKTTN GCLAWGGGVN
LAPADDIIIN VERPVSIDPQ PQLLASVMAK KIATGIKYTV IDIPVGKGVK IKNEAEGAKL
ARKFIELGES LNIKVECVLT YGGQPLGRAI GPALEAREAI EALQDPKNAP KSLIEKALSL
AGILLELGGA AQIGEGQNLA WEILESGKAL EKFNQIITEQ GGTPKKPEEI ELGDYVEEIL
APIDGYITDI SNTAITNVVK EAGAPRDKKA GILLNSKIGN KVKQGDVLYT IYSGSEERLV
SAINLARRVY PVKVEGMLIE RISKF