Gene Msed_1810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1810 
Symbol 
ID5105373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1752330 
End bp1753538 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content51% 
IMG OID640507709 
ProductUDP-glucose 6-dehydrogenase 
Protein accessionYP_001191888 
Protein GI146304572 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1004] Predicted UDP-glucose 6-dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGGATAG GGATTGTGGG ACTAGGAGTC GTGGGTTTAT CCACAGGAGT TGTGCTCGCT 
GAGCAGGGAC ATGAAATAGT TGGAGTCGAT ATTGATCAGG GGAGAGTTCA TGGACTTCAG
TGTAGGAGGC CTCCCATATA TGAGCCCGGT CTGGAGGAGG CACTGAACAG GAACTTTGAG
AGGATGAAGT TTTCAACCGA CTACTCGTCG CTCTCGGACG CTGAGGTCAT CTTTATCACT
GTTTCTACCC CAACCGTTGA TGGAAGGATA TACCTTGGCT ACGTGTTCGA CGCCGCGAGG
AAGATAAAGG AAGTGGCGAG GGGCGTGATT GCCATAAAGA GCACCGTAGT CCCAGGGACT
GCAAGGAAGG TCAAGGAAAT CACGAACCTT AAGGTTGTAT CCAACCCAGA GTTCTTGAGG
GAGGGAAACG CCCTGCACGA CACTGAAAGC CCGGACAGGG TTGTGATAGG TAGTGACGAC
AAGGAGGCAG GCGACCTGAT CCAGGAGCTG TGGTCCTTCA CGAAAGCCCC AGTGATCAGA
ACCACCACAG ACGAGGCTGA GCTAATAAAG TACGCTGCGA ACTCCTTCCT AGCAGTTAAG
GTGTCCTTCA TAAACGAAAT AGCTAACTTA TGCGAAAGGC TAAACTGCGA CGTTAACGTT
ATAGCCAGGG CAATTGGACT TGATAAGAGG ATCTCCCCTT ATTTCCTGTC AGCAGGACTA
GGCTGGGGAG GATCGTGTTT CCCGAAGGAT ACCCTGGCGA TAACCTCGTT CGCGAGAGAC
GTTGGGGAGA AGCTCAGAAT AGTTGAGGCA GCCATAGAGG TTAACCAGGA AAGACCCTTC
AGGGCTCTCA AGCTTCTTAA GGAAGTCATG GGCGAGGTTA GGGGTAAAAC TGTTTGTGTC
CTAGGACTGG CCTTCAAGCC TAATACCGAT GACACCAGAG AGAGCGTGGC ACTCAAGGTG
GTTAACCTGC TGAGACAGGA GGGAGCCATG GTTATAGCCT ACGATCCAAA GGCTAGATCT
GACGTCGAAA TGGTGACTCT TGACGAGTGC ATAACTCGGG CAGATGGTGT CATCATTGCT
ACTGAGTGGG ACGAGTTCAG GGGACTTGAG CCCAAGTTAA GGGGTAAACC AGTGGTTGAC
GGCAGAAGAG TTCTAGATCC AGCGAAGATG GGACAGGAAT TTAGGGCCAT TGGGCTTGGT
GTTAGGTGA
 
Protein sequence
MRIGIVGLGV VGLSTGVVLA EQGHEIVGVD IDQGRVHGLQ CRRPPIYEPG LEEALNRNFE 
RMKFSTDYSS LSDAEVIFIT VSTPTVDGRI YLGYVFDAAR KIKEVARGVI AIKSTVVPGT
ARKVKEITNL KVVSNPEFLR EGNALHDTES PDRVVIGSDD KEAGDLIQEL WSFTKAPVIR
TTTDEAELIK YAANSFLAVK VSFINEIANL CERLNCDVNV IARAIGLDKR ISPYFLSAGL
GWGGSCFPKD TLAITSFARD VGEKLRIVEA AIEVNQERPF RALKLLKEVM GEVRGKTVCV
LGLAFKPNTD DTRESVALKV VNLLRQEGAM VIAYDPKARS DVEMVTLDEC ITRADGVIIA
TEWDEFRGLE PKLRGKPVVD GRRVLDPAKM GQEFRAIGLG VR