Gene Mpal_2239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2239 
Symbol 
ID7272536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2387955 
End bp2389976 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content56% 
IMG OID643570851 
Product4Fe-4S ferredoxin iron-sulfur binding domain protein 
Protein accessionYP_002467255 
Protein GI219852823 
COG category[C] Energy production and conversion 
COG ID[COG1148] Heterodisulfide reductase, subunit A and related polyferredoxins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.350238 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0521387 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGAAG ACAAGAAAAA TCAAGAACCC AGAATTGGGG TGTTCCTGTG TCACTGTGGT 
ACAAATATTG CAGGTTCGCT CTCTATTGAT GATGTGCAGG CATATGCACA GACCCTCCCC
CACGTAGCCC ACGTAGATAA CTATCAGTAT ATGTGCTCGA CCCCGGGGCA GTCCAAGATC
AAAGATGCGA TCAAGGAATA CAATCTGACC GGAATTGTGG TCGCAGCCTG TACACCACGT
CTCCATGAGC CGACGTTCCG GATTGCCACT AAGGAGGGTG GACTGAACCC GTTTAGATTC
GAGATGGCGA ACATTCGGGA TCAGAACTCA TGGGTGCACA TGCACGACCA TGAGGGAGCC
ACCGAGAAGG CCAAGGACCA GGTCCGGATC GCGGTCGCCA AGGCAGCGCT CCTTGAGGAT
CTCTATCCAA AGTCGGTACC GGTGGAGAAG GCTGCGATGG TCGTTGGTGG AGGGGTTGCA
GGAATGCAGG CCGCACTCGA CCTCGCCGCA GCAGGGATCA AGACCTACCT GGTCGAGAAG
GAACCGTCGA TTGGTGGACG TATGTCCCAG CTCGACAAGA CCTTCCCAAC CCTGGACTGC
TCGCAGTGTA TCCTCTCCCC AAAGATGGTG GATGTTGGGA GAGCCGAGAA CATCGATCTG
TATACCTGTG CAGAGATCGA GGAAGTCGAA GGTTACATAG GTAACTTCGA CGTGACGATC
CGCCGGAAGG CCCGTGGTGT GCTCACACCC AAGGAGGCAG AGGCAAAGGG GATCGTCGGC
GGTGGCTGCA CCGGTTGCGG TGACTGCACC TCGGTTTGCC CAGTCGTGAA GCCGAACGTC
TTTGAGATGG GGATGGCTCC ACGGAAAGCG ATCTATATCA ACCACCCCCA GGTGGTTCCA
CTGATCTATA AGATCGACTT CGATTCGTGT GTGAAGTGCG GCCTCTGCGT AGAGGCATGT
GGCACCAAGC AGGCCGTCGA CCTTGAGATG CAGGACGAAC TGGTCAAGGT GAAGGTCGGA
ACCGTCATTC TCGCACTCGG GTATGAACTC TTCCCGATCG AGAAGAAGGA AGAGTGGGGA
TACAGGCGTT ACGACAACGT CATCACCGGT CTTGAGTTCG AGCGGCTGAT CTGTGCCTCA
GGTCCGACTG GCGGTCATCT GATCCGTCCG AGCGACGGCA AGACTCCGAT GAAGGTTGGA
TTCGTTCTCT GTGCGGGGTC ACGAGACAAC ACCGGCGTCG GCAAGCCATA CTGCTCCAGA
TTCTGCTGCA TGTATTCGCT CAAGCATGCC CACCAGATTA CCGAGAAGAT CCCGGGTGCA
ATTCCTTACA TATTCTACAT GGATATTCGA TCCTTTGGGA AGATGTATGA GGAGTTCTAC
TACCGTATCC AGAACGAGGG TGCCAAGTTC ATCAGAGGGC GTGTCGCCAA CATCCAGGAG
GATCCACTGA CCAAGAACCT GCATGTCTTT GCTGAGGATA CTCTCCTCGG TGAACCGATG
GATATGGAGG TCGACCTGGT CGTGCTCGCC TCGGCTGTTC AGCCGACCGA TATGACCGAA
AAGACCAGGA GACTCTTTGG GGTCTCCTGT TCGCAGGACG GATGGCTCCT TGAAGCACAT
CCGAAATTGA ACCCGTGCGG GACCACCACG GCAGGGGTCT TCCTTGCAGG GGTCTGCCAG
GGACCGAAGG ATATTCCTGA CACTGTAGCA TCTGCAGAAG GCGCTGCCTC TGCGGCATCG
ATTCCGATCC ACATGGGGCA GGTCGAGCTC GAGCCCTACT TCGCCATGTG CATCGAAGAG
AAGTGCGCAG GTTGCGGTAT GTGTGTGAAC CTCTGCCCCT ACTCGGCCCT CGCGCTCGTC
GAGAAGGACG GTCGGACTGT CATGCAGGTC ACAGAAGCCA AGTGCAAGGG ATGCGGGACC
TGTGGTGGAT TCTGCCCTGG TGGAGCAATC TGGATGAACC ATTTCACCTC TCCGCAGATC
CTTTCGCAGA TCGATTCGTT CCTTGTTGGA GGTGAGCAAT AA
 
Protein sequence
MVEDKKNQEP RIGVFLCHCG TNIAGSLSID DVQAYAQTLP HVAHVDNYQY MCSTPGQSKI 
KDAIKEYNLT GIVVAACTPR LHEPTFRIAT KEGGLNPFRF EMANIRDQNS WVHMHDHEGA
TEKAKDQVRI AVAKAALLED LYPKSVPVEK AAMVVGGGVA GMQAALDLAA AGIKTYLVEK
EPSIGGRMSQ LDKTFPTLDC SQCILSPKMV DVGRAENIDL YTCAEIEEVE GYIGNFDVTI
RRKARGVLTP KEAEAKGIVG GGCTGCGDCT SVCPVVKPNV FEMGMAPRKA IYINHPQVVP
LIYKIDFDSC VKCGLCVEAC GTKQAVDLEM QDELVKVKVG TVILALGYEL FPIEKKEEWG
YRRYDNVITG LEFERLICAS GPTGGHLIRP SDGKTPMKVG FVLCAGSRDN TGVGKPYCSR
FCCMYSLKHA HQITEKIPGA IPYIFYMDIR SFGKMYEEFY YRIQNEGAKF IRGRVANIQE
DPLTKNLHVF AEDTLLGEPM DMEVDLVVLA SAVQPTDMTE KTRRLFGVSC SQDGWLLEAH
PKLNPCGTTT AGVFLAGVCQ GPKDIPDTVA SAEGAASAAS IPIHMGQVEL EPYFAMCIEE
KCAGCGMCVN LCPYSALALV EKDGRTVMQV TEAKCKGCGT CGGFCPGGAI WMNHFTSPQI
LSQIDSFLVG GEQ