Gene Msed_1864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1864 
Symbol 
ID5104132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1808730 
End bp1809728 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content53% 
IMG OID640507750 
Productdiphthamide biosynthesis protein 
Protein accessionYP_001191928 
Protein GI146304612 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1736] Diphthamide synthase subunit DPH2 
TIGRFAM ID[TIGR00322] diphthamide biosynthesis protein 2-related domain 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.866923 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTTACA TTTTTGATGA AGACCTTCTC AAGTCCGAGA TCAGGAAGAG AGGGGCTAGG 
AGGGTTCTCC TTCAGTTCCC CGAGGGTTTA AGGTATTTCT CCACGGAGTT GGTGGAGAGG
TTAAGGGAAT CCCTTCCAGA CGTTGAGTTC GTGATATCGG GAGAACCGAG CTGGGGGGCC
TGTGACATAG CTGAAGACGA AGCCTCCCTT CTCAAGGTCG ACCTCCTCAT CCATTTCGGC
CACTCTCCTT ATACCTGGTA TTACCCCAGG TTTCCAACCC TCTTCGTTAA GGCTGAGAGC
ACGGCTCAAG TGGAGAGGGA GACCCTGGAC AAGCTAGTTG ATGTCCTTCG CGAGAGAGGA
GCTAACTCGG TCGCCCTAAC CTCGACCGTT CAGCACGGGA AACTACTGAA CCAGGTGAAG
GAGCACCTTT CCCCCCACTT CCACGTGGAG GTTGGAAGGC CTTCCTCACC TTTCATGGGG
GATGGACAGG TCCTGGGATG TGACTACAAG TCTGCCCAGG TTGAGGCCGA CGTGCACGTA
AACATCTCAG GTGGGGTTTT CCACGCCCTC GGACTGGGAC TGGCCACGGG TAAACCGGTC
ATCAAGCTTG ACCCCTACAC GAGATCTGTG GAGGACCTAA CTCCTCAGGT TTTCAAGGTC
CTCAAGGTGA GATATTCCAA GATCATGGAG GCCATGGACG CGAGGACCTG GGTCATTGTG
CAGGGATTGA AGGTTGGCCA GAACAGGCCC CTCATGGTTA AGTCCCTAGA GTCCAGGCTC
AAGTCCCTGG GGAAGACAAC CTACGTGGTC ACAAGCAAGG TTCTGAACCA GGACTCCCTC
AGAAACCTAG ATAGGAGCTA CATCGACGCC TTCGTGGTCA CATCGTGTCC AAGATTACCC
ACGGATGACC TCTACCTTTA CGAGAAGCCC GTGTTGACAC CTGGAGAGGC GAAAATGATT
ATAACCAATA AACTAGAACC ATACATATTT CCGTGGTAA
 
Protein sequence
MSYIFDEDLL KSEIRKRGAR RVLLQFPEGL RYFSTELVER LRESLPDVEF VISGEPSWGA 
CDIAEDEASL LKVDLLIHFG HSPYTWYYPR FPTLFVKAES TAQVERETLD KLVDVLRERG
ANSVALTSTV QHGKLLNQVK EHLSPHFHVE VGRPSSPFMG DGQVLGCDYK SAQVEADVHV
NISGGVFHAL GLGLATGKPV IKLDPYTRSV EDLTPQVFKV LKVRYSKIME AMDARTWVIV
QGLKVGQNRP LMVKSLESRL KSLGKTTYVV TSKVLNQDSL RNLDRSYIDA FVVTSCPRLP
TDDLYLYEKP VLTPGEAKMI ITNKLEPYIF PW