Gene Mpal_2089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2089 
Symbol 
ID7271566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2215916 
End bp2217325 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content63% 
IMG OID643570700 
Producthypothetical protein 
Protein accessionYP_002467110 
Protein GI219852678 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0175] 3'-phosphoadenosine 5'-phosphosulfate sulfotransferase (PAPS reductase)/FAD synthetase and related enzymes 
TIGRFAM ID[TIGR00451] uncharacterized domain 2 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCAC TCTATCTTGG AAAGATCCAG CTCCACTGGT GTGATTCCTG CCATGTTCCG 
GTGCTCGGAG GACGGTGCAG GTGTGGCGCT GAGACGCGTT CCGTTCCGTT GACCCCCCCC
GGGGACATGC GCCCGGCGTT CGAATACGAT ATCGACCTGA TCAACCGGAT CTATACCGCT
CACTTCGGGA CACCCCTGAT CCCGGATGGG CACCTCGCCC TGCTGAACAA GGTCCCTGAC
AAGGACCGGA TGGAGGAGAT CGTGGTCGGG GGGGCCGTGG TCGGCCAGAT TCGGTACCTG
CCGGAGAAGG AGGAATGGGA ACCGATTCCC CGGCCGGCTG CCGCGGCGCT GCTTACGCCG
ACCGAACGGT TCGTGGTGAT CGATGACGGG GCGATCCCTT CGATCCGGGA TGAAGGAGCC
AGTGTACTGG CGCCGGGGCT CGCCTGGATC GCCGATTCGG TCGAGGCCGG CGATGAGGTC
TTCATCATGA CCAGGGACCA CCAGTGTGTC GGTGTCGGGC GTGCCCGGGT CGGTGCAGCC
GAGGCACGGA CCATGGAACG GGGAGCCATC GTGAGGACCA GAAGGAACAC CTCCGCTCCC
TGTATCCCCG GGGAGGCCAC CTGGGCGGAC GCGGTCAGGG CCAATCAGGC GATCATCGAC
AACTACGAGG CGATGGCGAT GGCCTTCGTC CGCGATGTCG CCGCGGCAAA CCCCATCCCG
GCGACGGTCT CGTTCTCTGG CGGAAAGGAC AGCCTGGTGA CGCTACTGAT CGTCCAGAAG
GCGCTTGGAA AGGTGCCGAT CCTCTTCTCC GATACCGGGC TGGAGTTCCC TGAGACCTAT
CAGAATCTCA AGGATGTGCA GGAGAAGTAT GACCTCGAGG TCGTCTCCTG TTCAGGTGAG
GCCGCGTTCT GGGAGACGCT CACCGAGCAG GGGCCGCCGG CCGTCGATGC ACGCTGGTGC
TGCAAGGTCT GCAAGCTGAC CCCGATCGGC GGGGCGATCC GGGAGCGCTG GGGGGAGTGC
CTCTCCTTCA TCGGACAGCG GAAGTATGAG TCGTTCAAGC GGATGAAGAG CGGGCGGGTC
TGGAGGAACC CGAATCTGCC GATCCAGCTC TCGGCGGCTC CGATCCAGCA CTGGACCGCC
CTGCATGTCT GGCTGTACCT CTTCGCCGAG GAAGCACCCT ACAATGCACT GTATCGGGCC
GGGTTCGACC GGGTCGGCTG TTATATGTGT CCGTCCAGCG ATCTCTCGGT GCTGCTCAGG
ATCGAGCAGG AGTATCCGGA CCTCTGGGAG CAGTGGAACC ATCGGATCGC AGCCTGGCAG
CAGGATAAGG GGTTGCCAGA GGACTGGTTC CGGTCGGGAT CGTGGCGGAA GAGGGCAGGT
GATTCTGGTG AAGAAGATAG TAGTTGTTGA
 
Protein sequence
MPSLYLGKIQ LHWCDSCHVP VLGGRCRCGA ETRSVPLTPP GDMRPAFEYD IDLINRIYTA 
HFGTPLIPDG HLALLNKVPD KDRMEEIVVG GAVVGQIRYL PEKEEWEPIP RPAAAALLTP
TERFVVIDDG AIPSIRDEGA SVLAPGLAWI ADSVEAGDEV FIMTRDHQCV GVGRARVGAA
EARTMERGAI VRTRRNTSAP CIPGEATWAD AVRANQAIID NYEAMAMAFV RDVAAANPIP
ATVSFSGGKD SLVTLLIVQK ALGKVPILFS DTGLEFPETY QNLKDVQEKY DLEVVSCSGE
AAFWETLTEQ GPPAVDARWC CKVCKLTPIG GAIRERWGEC LSFIGQRKYE SFKRMKSGRV
WRNPNLPIQL SAAPIQHWTA LHVWLYLFAE EAPYNALYRA GFDRVGCYMC PSSDLSVLLR
IEQEYPDLWE QWNHRIAAWQ QDKGLPEDWF RSGSWRKRAG DSGEEDSSC