Gene Mpal_2131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2131 
Symbol 
ID7271611 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2262962 
End bp2264215 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content55% 
IMG OID643570745 
Productnucleotide sugar dehydrogenase 
Protein accessionYP_002467152 
Protein GI219852720 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0677] UDP-N-acetyl-D-mannosaminuronate dehydrogenase 
TIGRFAM ID[TIGR03026] nucleotide sugar dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCAGA AATCAGTAAC TGATAAGACC GTCTGTGTAG TCGGCCTCGG GTATGTGGGC 
TATCCACTGG CAGAAGCCTT CTCGCATCAT CTGAAGACGA TTGGATTTGA TATCGACGCA
CGGAAGATCG GGATCATCAA CGAGACCCCT GGGAACAAGG TCCAGGCAAC CACTGATCCC
TCTCATATTG GGGAGGCTGA CATCATCATC ATTGCCGTGC CGACACCGGT GACCAAGGCC
AAGGATCCTG ACCTGATGCC GGTGATTTCA GCGGCTGAGA CTATTGGGAA ACATATCAAG
AAGGGTGCCA TCGTGGTCCT CGAATCCACG GTCTACCCTG GGGTGACTGA GGAACTCATG
GCCCCGATGA TCGAGAAGAT GTCTGGGTTT ACCTGCGGAA AGGAGTTCAA GATCGGGTAC
TCACCTGAGC GGATCAACCC TGGTGACGAG GAACACATCC TTCCAAAGAT CACGAAGATC
GTCTCAGGGA TGGATGAGGA GACCCTCGAA ACACTCGCCT CGCTGTACAG CCTCGTGACC
ACGGTCTACC GTGCAGAGAA CATCCGGACG GCTGAGGCTG CCAAGGTGAT CGAAAACATC
CAGCGTGACC TGAACATCGC CCTGATGAAC GAGCTCTCGC TGATCTTCCA GAAGATGGGT
CTCGACACCC AGGCTGTCCT CGAGGCGGCG GGAACCAAGT GGAACTTCCA CCATTACCGT
CCAGGTCTGG TCGGCGGTCA CTGTATTCCG GTCGACCCCT ACTACCTCGT CTACAAGGCA
GAGGAGCTCG GGTACCACCC TCAGGTGATC CTCGCCGGCC GTGCGATCAA CGACTTCATG
CCCAAGCATG TCGCAGAACT GGCTATCAAG GGGATGAACG ACGCCGGCAA GGTGATCCGG
GACTCGAAGG TGCTGATCCT CGGGCTCACC TACAAGGAGA ATGTGCCTGA CACCCGGGAG
AGCCCGTCCC ATGAGATGAT CAAGGAACTT CGGGAGTTCC GTGCTGATGT CTATGGGTAT
GATCCACTCC TTTCAGAGGC CGACATCAAC CAGTTTGGTG TGAAGCCTCT TAAGGATCTG
AACGGGTTCA AGGCTGACTG TATCATCATC AATGTTCCGC ATGATGCCTT CAAGTCGCTG
ACCCTTGAAG GGGTTGAAAA GATGAGCAAC GGCACCCCGG TCGTCATCGA TGTGAAGGGA
ATGCGCAAGC AGTGGGCCGA TCCAGCCTGC AAGGTTTGCT ATACCCGGCT CTAA
 
Protein sequence
MVQKSVTDKT VCVVGLGYVG YPLAEAFSHH LKTIGFDIDA RKIGIINETP GNKVQATTDP 
SHIGEADIII IAVPTPVTKA KDPDLMPVIS AAETIGKHIK KGAIVVLEST VYPGVTEELM
APMIEKMSGF TCGKEFKIGY SPERINPGDE EHILPKITKI VSGMDEETLE TLASLYSLVT
TVYRAENIRT AEAAKVIENI QRDLNIALMN ELSLIFQKMG LDTQAVLEAA GTKWNFHHYR
PGLVGGHCIP VDPYYLVYKA EELGYHPQVI LAGRAINDFM PKHVAELAIK GMNDAGKVIR
DSKVLILGLT YKENVPDTRE SPSHEMIKEL REFRADVYGY DPLLSEADIN QFGVKPLKDL
NGFKADCIII NVPHDAFKSL TLEGVEKMSN GTPVVIDVKG MRKQWADPAC KVCYTRL