Gene Msed_2078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_2078 
Symbol 
ID5105058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1996599 
End bp1997540 
Gene Length942 bp 
Protein Length313 aa 
Translation table11 
GC content50% 
IMG OID640507968 
Productbifunctional phosphoglucose/phosphomannose isomerase 
Protein accessionYP_001192142 
Protein GI146304826 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0166] Glucose-6-phosphate isomerase 
TIGRFAM ID[TIGR02128] bifunctional phosphoglucose/phosphomannose isomerase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.810924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAATG TTTTCCTAGA CTGGGATAAA CTTTTTCGAG AAGCTGAAAG GATAGAGGTT 
CCGGACCTCA AGTACGATAA CGTGATTTAC ACAGGGATGG GGGCAAGTTA CATCCCGGGT
GAAATGGCAA GGATACTTGA ACCTCCATTG GACTACCTCG TGTATAATGG GGATCCCACA
AGATTCAAGG CTAGGGGAAA GTTCTCTTTG CTAGCCTTTA GCAGGTCCGG GGACAATGTG
GAAACCTTGA TCGTAACCAG AAGGGCCTTT GAGCTTGGGG CGGACGTCAT ATGCGTTTCG
GCTGGAGGAA AGCTGGCAAA CCTGTGCAGA GAGAAGGGCG GAAGACACGT TAACTTGTCC
ATGCAGGCCA GGATGAGCAA CGGTCAAGAT TACCCCACGA GAGTCTGGTT CCCCCTCCTT
TTCACGGCCC TCGTTAAGAT TCTGAACACG AGGAGTAGCG GGCAATACAG GATTTCGGAG
CTGGCAGAGG GAGTGGAGGA AGGTAAGGAG AGGGCTCTTA ACCTTGCTAA GAGGTTGGTG
GCCAAGATTA GGGGAAGGAT CCCGGTCTTT TACGGCTCCC TATACTTTCC CGTAGCAATA
AGGTTCAAGC AGGATTTGAA TGAGACTGCC AAATATCCAG CCTTCTACGG GCCCATTCCT
GAATCGAATC ACAATGACCT AGAGGCATAC GTCAGGGCAC AGAGCCTTCA GCCCTTTGTG
ATTGGGGATC AGGACATTGA TTACGTAACG CTTTCCGTGA TTAAGGCTGA ACAGATAATT
CCTGCAGGGA GCACACCGCT GAAGAACGTG GCCTACTTGG TTCTTCTCTC AGGTCTCACG
TCCCTCCTGC TTGCAGAGGA AGAGGGATTA ACGGAAGAAG AGGCCTTCAG CGACAGCAAC
CTTAAAATTG CAAGGAAACT GGCAAACTTA ATCCTGAAGT GA
 
Protein sequence
MHNVFLDWDK LFREAERIEV PDLKYDNVIY TGMGASYIPG EMARILEPPL DYLVYNGDPT 
RFKARGKFSL LAFSRSGDNV ETLIVTRRAF ELGADVICVS AGGKLANLCR EKGGRHVNLS
MQARMSNGQD YPTRVWFPLL FTALVKILNT RSSGQYRISE LAEGVEEGKE RALNLAKRLV
AKIRGRIPVF YGSLYFPVAI RFKQDLNETA KYPAFYGPIP ESNHNDLEAY VRAQSLQPFV
IGDQDIDYVT LSVIKAEQII PAGSTPLKNV AYLVLLSGLT SLLLAEEEGL TEEEAFSDSN
LKIARKLANL ILK