Gene Mpal_1957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1957 
Symbol 
ID7270761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2070984 
End bp2073029 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content61% 
IMG OID643570570 
ProductCHAD domain containing protein 
Protein accessionYP_002466983 
Protein GI219852551 
COG category[R] General function prediction only 
COG ID[COG0622] Predicted phosphoesterase 
TIGRFAM ID[TIGR00040] phosphoesterase, MJ0936 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0958558 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACCTA AACGCCGCCA GCCCCGGAGG AGCCGGCATG TCGAGATCCC CAGAGCAGAC 
GCAGGGTACT GTGTCTTTGG GGCCACCTAC CTGGCCGACC TGATCGTGGC CTTCGCCGCA
GAGGCCGGTG GGGTCAGAGC GGGTGTGGAT ATCGAGTACA TCCACCGAAT GCGGGTGGCC
ACTCGTCGTC TTCGAGCGGC CCTCCCCCTC TTTGTGGACT GTTATACAAA GACCGAATAC
CGTCGGTGGC TCTCGTCCAT CAAGGCGATC ACCCAGGCAC TCGGTGAGGC ACGGGATGCC
GACGTCCAGA TCGCGTTCCT CGATCAGTAC CTGCAGGGGA TCCGTGGGTC GGCTTCCGGT
TCATCAGCCA GCATGATCCG ACCCCTCAAT CAACAGCGGC CGCCGGATAA GCCGGAAGAA
CCCGAACCGA TAGCCCTTCC TCCGTCGGTT CCACTTCCTC AACCCGGGCT GCTCGGTGCT
CTCTGGCAGA TGCTCCGTCG GGTTTCGACA GCGATCACCG TCACCGGGGA GCCTGCTGTG
GAGAAAGCCC CTCCCCTCCC TCCACCAGAG GAGGAACGAT CATTTTCTGC TGGTCAACAG
GGGATCGAGT GCCTGTTGCT TCGTCTTCAG CAGCGGCGGG AGGCCCTGCA GCCGCAGGTG
ATCGAGGCGC TCGACCTGCT TGAAGAGCGC GGGGTGGTGA GGGAGATGCA GCGGCGGGTC
AGGATGATCG CCGTCACCGG AAAGCGGGAT CAGGTGGATA TCCACACCGC CGATGTCTAT
CAACGGGCAT ACAACGCGAT CCAGCTTCGG GTCCTGGAGA TCTTCGATCA TGAATCCTCC
GTGCCTCGTC CAGATCTTAT CACCGAACAT CATGAGATGC GAAAGGCTGC CAAACACCTG
CGGTACACGA TGGAGACCTT CGCCCTGCTC TATCCCGGAG GCCTGAAAGG GGAGTTGAAA
GCAGTCAAGC AGTTACAGGA ACTGCTCGGC GACATGCATG ACTGCGATGT CTGGATCGAA
TCCTTACCAC GCTTCCTCGT CGAGGAGAGG CAGCGGACCG AGACTTACTT CGGACATGCG
GAGTTCTTCT CGCTCATCGA GCCTGGCATC ACCCGTCTCC GTGAGGAGCG ACAGGGGAGA
CGGAATCTGG TGTACACCGA TTTCGTGACC TACTGGGACG AACTGAAGAA GGAACTCTTC
TGGGATAAAC TCCGGGATAC CCTGGGTACT GCGCTTGAGA GTGTCCAGTC CCCGCCCGCT
CCCCTGGCCC GTGCAGCAGA GCGGAAGGGG CCGGTCAGGA TCGCGTTGAT CGCTGATGTC
CATGCCAACC TTCCGGCGCT GGAGGTGGTC CTCTCCGATG CAGTCCAGCG GGGGGCTTCG
GTGGTGATCA ATGCCGGCGA TATGGTCGGT GGCGGTCCAT TCCCTGACGA GGTGGTCAGC
AGGCTCCGAA GGGCCGAATC GATCGATATC AAGGGAAATG CCGAGCGGAA GGTGCTCTCG
GTCCAGACTG GTAAGCACCC GAAGGGAAAG GGCCGGAATC TGGCCATCTG GACCTGGGAG
GCACTCTCTG CGGAGAACCG GACCTATCTC GCCGATCTCC CTGAAGAACT GCGGTTCATG
GTCAGGTCCA CTCGTCTGCT CGTGACCCAT GCCTCCCCGC TCGACCCCAG GGAACTGCTG
ACTGTATCGA CGTCTGCTGT CCGATTCAGT GAGCTCGGCC GGGCAGCCGG GGCGGATATC
GTGATCGTCG GCCACTCCCA CCAGCAGTTC TCGACGATCG TCGATGGGGT CCGATTTATC
AATCCCGGGA GTGTCGGGAC ACCGACCGGA AATGACAATC GGGCCAGTTA TGCCCTCCTG
CAACTCGAAC CCTACGACCT CTGCCACTTC CAGATCGCTT ATGACCGCGA GCCTGTAATC
AGGGCCAGCA TTGAACGTGG TCTGCCGGGA TCAGCGCCCA ACCAGCACCT CCTTGTTCAG
CCAGCTCTGG ATACGGTAAT CCCTGCAGAC CCTGGGAATC GCTCTCATGA GGAGCCAGAC
TCATGA
 
Protein sequence
MPPKRRQPRR SRHVEIPRAD AGYCVFGATY LADLIVAFAA EAGGVRAGVD IEYIHRMRVA 
TRRLRAALPL FVDCYTKTEY RRWLSSIKAI TQALGEARDA DVQIAFLDQY LQGIRGSASG
SSASMIRPLN QQRPPDKPEE PEPIALPPSV PLPQPGLLGA LWQMLRRVST AITVTGEPAV
EKAPPLPPPE EERSFSAGQQ GIECLLLRLQ QRREALQPQV IEALDLLEER GVVREMQRRV
RMIAVTGKRD QVDIHTADVY QRAYNAIQLR VLEIFDHESS VPRPDLITEH HEMRKAAKHL
RYTMETFALL YPGGLKGELK AVKQLQELLG DMHDCDVWIE SLPRFLVEER QRTETYFGHA
EFFSLIEPGI TRLREERQGR RNLVYTDFVT YWDELKKELF WDKLRDTLGT ALESVQSPPA
PLARAAERKG PVRIALIADV HANLPALEVV LSDAVQRGAS VVINAGDMVG GGPFPDEVVS
RLRRAESIDI KGNAERKVLS VQTGKHPKGK GRNLAIWTWE ALSAENRTYL ADLPEELRFM
VRSTRLLVTH ASPLDPRELL TVSTSAVRFS ELGRAAGADI VIVGHSHQQF STIVDGVRFI
NPGSVGTPTG NDNRASYALL QLEPYDLCHF QIAYDREPVI RASIERGLPG SAPNQHLLVQ
PALDTVIPAD PGNRSHEEPD S