Gene Mpal_0505 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0505 
Symbol 
ID7271921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp509749 
End bp511392 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content64% 
IMG OID643569152 
Productdihydroxy-acid dehydratase 
Protein accessionYP_002465601 
Protein GI219851169 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAGTG ACGAGGTCAA AGCCGGATAT CAGCGGGCAC CCAACCGCTC GCTGCTTCGA 
GCGCTGGGGG TGACCGACGA AGAGATGAAC CTGCCATTCA TCGGGATCGC GAACGCGTGG
AACACGATCG TGCCGGGCCA CCTGCACCTT CGGACCCTGG CCGAGAAGGT CAAGGGAGGG
ATCTGCGCAG CCGGAGGGGT GCCCTTCGAG TTCGGGGTGA TCGGGATCTG TGACGGGATC
GCGATGGGTC ATTCCGGGAT GCGGTACTCG CTCCCCTCGC GGGAGACGGT CGCCGACTCG
ATCGAACTGA TGGCCGAGGC GCACCGACTC GACGGGCTGG TCTGCATCGG CACCTGTGAC
AAGATCGTCC CAGGGATGCT GATGGCGGCG GCACGGTGCA ACATCCCGAC GATCGTGCTA
ACCGGCGGAC CTATGCTCTC CGGGTGCAGC AATGGCAAAG ACCTCTCCCT GACCGATGTC
TTCGAAGGGG TTGGGAAGGT CGCGGCAGGC ACGATCACTG AAGAGGAACT GCACACTCTC
GAATGCACAG CGATGCCCGG ATGCGGCTCT TGCCAGGGGC TCTACACCGC CAACACGATG
GCCTGCATCA CCGAGTCGCT CGGGATGTCG CTCCCCGGGT GTGCAGCCAT CCCAGCGGTG
GACGCAGCAA AACTCAGGAT TGCCCGCAAA ACCGGTGAAC GGGTGGTCGG GCTCGTGAAG
GAGCAGGTGA CGCCGCGGGC GATCATCACA GCTCCAGCGA TCAGAAACGC GATCAGGGTC
GACATGGCCC TCGGCGGTTC GACCAACACC GTCCTCCACC TGATGGCCAT CGCAGAAGAA
GCCGGGCTTC CCTTTGATAT CGATCAGTTC ACTGCTATTG CAGAACAGGT CCCCCATATC
GGTGCCATGC AGCCCGGCGG GCCATACTCG ATGCAACAGC TTCACCATGC CGGCGGGATC
CCTGCCGTCG AGCAGCGACT GATCAGTCTG CTCGAAGACG GACCCACCGT ATCCGGGCAG
AACGTGCTCC AGATCGCGGC CAGGGGCGTC GTCACGGACG GGAAGGTGAT CGGAACGATT
GAGCACCCGG TACATGCCGC CGGCGGGCTA AAGATCCTTC GCGGGAGCCT CGCCCCAGAC
TCTGCTGTCG TGAAGTGTTC AGCCGTCAAC GAGGATATGT GGACCCACCA GGGACCGGCC
AGGGTCTTCG ACGGTGAGCA GGCCGCCATG GATGCGATCC TCAGCCGGCA GGTGCAGGAG
GGAGACGTGA TCGTGATTCG GTATGAGGGG CCGAAGGGCG GGCCCGGGAT GCCGGAGATG
CTCTCGCCGA CCTCCGCCCT GATGGGGCTC GGGTACACCC GCGTCGCTCT GGTCACGGAC
GGACGGTTCT CCGGCGGTAC CCGTGGTCCC TGCATTGGGC ACGTTGCCCC TGAAGCAGCG
ATCGGCGGGC CGATCGCACT GGTGAAAGAC GGAGATCTGA TCAGGATCGA TCTCAATGCA
AGGTCCATAG ACCTGCTGGT CGACGACGCC ACCCTTTCGG ATCGGCGGAA GTCATGGCAA
CCGTCAGAAC AGTTCCTCTC CGGGGTGCTG GCCAGGTATG CTGCCACAGT CGGACAGGCA
GATCACGGCG CTGTTCAGCG GTGA
 
Protein sequence
MRSDEVKAGY QRAPNRSLLR ALGVTDEEMN LPFIGIANAW NTIVPGHLHL RTLAEKVKGG 
ICAAGGVPFE FGVIGICDGI AMGHSGMRYS LPSRETVADS IELMAEAHRL DGLVCIGTCD
KIVPGMLMAA ARCNIPTIVL TGGPMLSGCS NGKDLSLTDV FEGVGKVAAG TITEEELHTL
ECTAMPGCGS CQGLYTANTM ACITESLGMS LPGCAAIPAV DAAKLRIARK TGERVVGLVK
EQVTPRAIIT APAIRNAIRV DMALGGSTNT VLHLMAIAEE AGLPFDIDQF TAIAEQVPHI
GAMQPGGPYS MQQLHHAGGI PAVEQRLISL LEDGPTVSGQ NVLQIAARGV VTDGKVIGTI
EHPVHAAGGL KILRGSLAPD SAVVKCSAVN EDMWTHQGPA RVFDGEQAAM DAILSRQVQE
GDVIVIRYEG PKGGPGMPEM LSPTSALMGL GYTRVALVTD GRFSGGTRGP CIGHVAPEAA
IGGPIALVKD GDLIRIDLNA RSIDLLVDDA TLSDRRKSWQ PSEQFLSGVL ARYAATVGQA
DHGAVQR