Gene Mpal_2191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2191 
Symbol 
ID7270276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2330830 
End bp2331927 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content50% 
IMG OID643570805 
ProductCDP-glucose 4,6-dehydratase 
Protein accessionYP_002467210 
Protein GI219852778 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR02622] CDP-glucose 4,6-dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00831743 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATCT CAGATAAACT GGCGGAAACA TACCGTGGAC GTACGGTTCT CATAACCGGT 
CATACCGGAT TCAAAGGTGG CTGGCTTGCT TTGTGGCTTG AATCCCTGGG GGCACACGTG
ATCGGATACA GCCTCGACCC GCCAACAGAT CCTTCATTTT TTCAAGAGAC AGAATTATCC
CGCCGTATTA CTGATATTCG AGGTGATATC CTGGATCAAA CAAAACTGGA TCGGGTAATC
AATGAATACC GCCCGGATTT TGTGTTCCAT CTCGCTGCCC AGCCCCTAGT TCGTGCATCC
TATCAAAGTC CACGTGAAAC GTTCAATGTA AACGTCATGG GGACGGTTAA TGTTCTTGAA
TCCATCCGTG TCTCTCAACA TCCGACGGTC TGTGTCTGTA TCACCAGCGA CAAATGTTAT
GAGAACAAGG AATGGGATTA CGCCTACCGG GAGAACGATC CGATTGGCGG CCATGACCCC
TACAGCGCGA GCAAAGGTGC TGCCGAGATT GTAATCGCTT CGTACCGGAA GAGTTTCTTC
GAACCGGATG GATCACAGCC ACTGTGTGCT CTCTCGTCAG CACGAGCCGG AAATATCATC
GGTGGTGGGG ACTGGGCAGA TGATCGGATC GTCCCGGACT GTGTGCGATC GCTTGTGAAC
GGGGAGACGA TGCTGCTTCG TAACCCCACT GCAGTACGTC CCTGGCAATT CGTGCTGGAT
CCCCTGTTCG GCTATCTCCT CCTTGCGCAG AGAATGAAGG AGTATCCTGG GGAATATTCT
GGTGCATGGA ATTTTGGTCC ATATTATTCC AACAACGTGG ATGTCCAGAC GCTCACTGGA
AAGATCTTCC GGGAGTGGGG TATCGGGAGA TGGGAGAATA TGCCTCAACA GAATAATCTG
CATGAGGCGT GTTTCCTCAA ACTGGATATC GCCAAGTCGA TGACAAGATT GGGGTGGAAG
CCTGTTTATT CCATAGACGA TGCGATCCAT AAAACTATAG AGTGGTATAT GGCCGATTTC
TCTCGGGCTG AAGAGATGTA CAACTTTTCT CTTGACCAGA TTGCAATGTA CATGCACGAC
GCAGATAATG TGGAGTAG
 
Protein sequence
MSISDKLAET YRGRTVLITG HTGFKGGWLA LWLESLGAHV IGYSLDPPTD PSFFQETELS 
RRITDIRGDI LDQTKLDRVI NEYRPDFVFH LAAQPLVRAS YQSPRETFNV NVMGTVNVLE
SIRVSQHPTV CVCITSDKCY ENKEWDYAYR ENDPIGGHDP YSASKGAAEI VIASYRKSFF
EPDGSQPLCA LSSARAGNII GGGDWADDRI VPDCVRSLVN GETMLLRNPT AVRPWQFVLD
PLFGYLLLAQ RMKEYPGEYS GAWNFGPYYS NNVDVQTLTG KIFREWGIGR WENMPQQNNL
HEACFLKLDI AKSMTRLGWK PVYSIDDAIH KTIEWYMADF SRAEEMYNFS LDQIAMYMHD
ADNVE