Gene Mbur_1436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMbur_1436 
Symbol 
ID3998464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanococcoides burtonii DSM 6242 
KingdomArchaea 
Replicon accessionNC_007955 
Strand
Start bp1515894 
End bp1516889 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content47% 
IMG OID637959195 
Productdiphthamide biosynthesis protein 
Protein accessionYP_566096 
Protein GI91773404 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1736] Diphthamide synthase subunit DPH2 
TIGRFAM ID[TIGR00322] diphthamide biosynthesis protein 2-related domain 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCAGCA GTGAGCCTTT CAATTTTCAG ATAGACTATA TAATTGAGGT AATAAGGAAA 
GTCCAACCTG AGATGATCGG ACTGCAATTT CCAGAGGGTT TTAAGAGGCG AAGTCCTGCT
ATTGCTTCAG AGATAAGTGA AGCTACAGGA GTGGAAATAA TTATTTCTGC GAATCCCTGT
TATGGTGCTT GTGACCTCGA TATACCCATA CTTGAGAACG TTGACCTGCT TTTCCATTTC
GGACATGCAC AACTTGATGA TAACAGGTAT AGCGACAAGG TCATTTTTAT AGAGACACGC
TCCGATGCGG ATGTCAAGGG TGTTGTTGTC AAGGCCTTAG AGGAACTCCA GGGGAAGAAA
GTGGGACTTC TGACAACTGT ACAGCATGTC CATAAATTAC CGGAGGTTCG GGAGATACTT
GAGAACAACG GTAAGGTCGT AGCCATAGGA AGAGGCGATA GTAAGATAGC ATATGCAGGA
CAGGTACTTG GATGCAATTT CTCGGTTGCT GACGACCTGG ACTGTGAGGA GTTCCTGTAT
ATTGGCAGTG GAGGATTCCA TCCACTTGGT GTTTCGCTTG CCACTGGAAA GAGGGTACTT
ATTGCAGATC CTTTCTCAAA GGATGTGAGG GAGGTCGATC CTTCCCTTAT CATGCGTCAG
CGTAGTGCAG CAATTGCGAA GTCATTGGAT GCCGAGTCTT TCGGGATCAT TGTTTCCAGT
AAACCAGGGC AGTACAGAAT GGAACTTGCC AGGGAACTTA AAGGACTTGC CGAGGAGAAG
GGGAAGACCG CCTATATATT GATACTTGAT CTTATCACGC CTGACCAGAT GCTACAGTTC
AAAGTGGATG CTTTTGTGAG CACTGCATGT CCGCGGCTTG CTGTAGATGA GGTGGGAAGG
TTCTCCGCAC CAATGCTTAC CCCACAGGAA TTCGAGATCG TTCTCGGAAA ACGTGAGTGG
GAAGACATGG TCTTTGATGA GATAAGAGGA GACTGA
 
Protein sequence
MSSSEPFNFQ IDYIIEVIRK VQPEMIGLQF PEGFKRRSPA IASEISEATG VEIIISANPC 
YGACDLDIPI LENVDLLFHF GHAQLDDNRY SDKVIFIETR SDADVKGVVV KALEELQGKK
VGLLTTVQHV HKLPEVREIL ENNGKVVAIG RGDSKIAYAG QVLGCNFSVA DDLDCEEFLY
IGSGGFHPLG VSLATGKRVL IADPFSKDVR EVDPSLIMRQ RSAAIAKSLD AESFGIIVSS
KPGQYRMELA RELKGLAEEK GKTAYILILD LITPDQMLQF KVDAFVSTAC PRLAVDEVGR
FSAPMLTPQE FEIVLGKREW EDMVFDEIRG D