Gene Mpal_0220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0220 
Symbol 
ID7270605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp252843 
End bp254141 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content57% 
IMG OID643568872 
Productputative PAS/PAC sensor protein 
Protein accessionYP_002465329 
Protein GI219850897 
COG category[T] Signal transduction mechanisms 
COG ID[COG2202] FOG: PAS/PAC domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACCAC CTTCAGAGGA ACTGCGGAGG ATCAGGGAGA TCCTGAAGCA GATACCGCAG 
GGGATGAGTG TGACCGAGAT TGCACGCGCC CTCGGAAAGA ATAAACACTC TGTCGGGAGG
TACTTGGACA TTCTGCGAGT CTCCGGTCAT GTCGAGATGA GGAACTACGG GATGGCCAAA
GTCTTCACCC TCTCCCAGCG GATTCCACTC AGCGCCCTCC TCTCCTTTCC CTCTGAGATG
ATCATGGTCC TCGACCAGGA GCACCGGATC GGACAGATCA ACGACCAGTT CCTACAGTTC
CTGCAGATTG AGCGCGGAGA GGTAGTCGGA CGGCAACTGG AATACCTGCC AGTCCCAAAC
CCAGCGGTCC ATGACCTCGT CCTACAACTG CTCGCTGCAC TGAACGGGGA GGAAGTCGCG
GACGAACTTG AGATTCCCAC CGACCCGGCC AGAATATTTA AATTAAAAGC TGTTGCGACG
GTATTCGATG ATGGGACACA GGGGATGACT GTGATCCTTG AGGATATCAC CGCCCAGAAG
CAGGCAGAGC AGGCACTGAA ACAGAGCGAA GCGCTCTTTC GGGGGATGGC TGAGAACATT
CAGGATGGGC TGGTCATCAG CAGGGACCGG GAGATGGTCT ACGTCAACGA GCGGGCTGCC
GCAATCCTCG GTTATCCGCG TGACGAGATC TTTGCGATGA CTGCCCTTGA TGTGATTGCT
TCTGAAGAGC AAGAGCGGGT CAGGCCCCTG GTCGATGAAT ACAATCAGTC AGGCGGGGTG
CCCAAAGAGC TTCGATTCTG GATCGTCCAG AAGAGCGGAA AACGGCGGTA CCTCTCCGCC
CGCCTCTCCT CAATCGATCA TGAGGGGGAT CATATCGCAT ATATCGTGCT GACCGATATG
ACCGAATGGA AGGAGGCCGA AGATACACTG AAACGGCAGT ACCTGTTTGT CCACCACTTC
ATCGATGCCT TCCCTCGCCC GATCTACTGT CTGAATCCGG ACCGGCGGTT CCTCGAATGC
AACCAGGCCT TTGAAGAGAT GGTCGGGCGA TCCCGGGCCG TGATCATCGG AGCCAAGACG
GCTGACGTCT TCCCTGCAGA GGACCTAGCA GTGTACGAGC AGGGGGACGA CGACCTATTT
CTGGAACCGT CCACCAGCAC ATACGAGGCA ACCCTGCAGT TCCCCGATGG ATCCAGACGG
CAGATGACGA TCGAGAAGGC CACGCTCAGA TCCCCCGAGG AAGGGGCCTC CTTAACCCTG
ATCGGAAACC TGATCGAGCG CGGACGGCAG CAGCACTGA
 
Protein sequence
MKPPSEELRR IREILKQIPQ GMSVTEIARA LGKNKHSVGR YLDILRVSGH VEMRNYGMAK 
VFTLSQRIPL SALLSFPSEM IMVLDQEHRI GQINDQFLQF LQIERGEVVG RQLEYLPVPN
PAVHDLVLQL LAALNGEEVA DELEIPTDPA RIFKLKAVAT VFDDGTQGMT VILEDITAQK
QAEQALKQSE ALFRGMAENI QDGLVISRDR EMVYVNERAA AILGYPRDEI FAMTALDVIA
SEEQERVRPL VDEYNQSGGV PKELRFWIVQ KSGKRRYLSA RLSSIDHEGD HIAYIVLTDM
TEWKEAEDTL KRQYLFVHHF IDAFPRPIYC LNPDRRFLEC NQAFEEMVGR SRAVIIGAKT
ADVFPAEDLA VYEQGDDDLF LEPSTSTYEA TLQFPDGSRR QMTIEKATLR SPEEGASLTL
IGNLIERGRQ QH