Gene Mpal_0101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0101 
Symbol 
ID7272271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp116789 
End bp120733 
Gene Length3945 bp 
Protein Length1314 aa 
Translation table11 
GC content56% 
IMG OID643568758 
Producthypothetical protein 
Protein accessionYP_002465217 
Protein GI219850785 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.470357 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCA ATAAGGATCG CCGGGGTTCA TCCGGTGAAG AACGGCAGCG ATGGCACGAG 
TGGGGTGTCT CTGAGGTGAT CGGTGTCGTC CTCCTCCTCT CCCTCGTGGT CGTCGGCGGC
ACCGTCGTCG GGACACAGCT CCTCTCTCAG CCAGCGCCCA AGGAGATCCC GAATGTGAAC
TTCCTGGCGA ATTACAACCC GGGAAACCTG ACCCTGTATC ATACCGGAGG GGATACCCTC
CCCAACGGGG ATTACAAGTT TGTCGTGCAG TACCTGGACG GTACAAAATC TCAAACATTC
AACGCCACTA AAGACTGGTC GTCCGGCAGT CCACTCACCT TCCCGGCCGT ATCACAGCCA
GCCAAGGTGA CCCTGGTCTA CACCGGCAAC GGAGCCGGCG AGACGGCACT CCGGTCGGTC
GTCATGGGAG ACCAGGGCAG CGATGTCGGG GAGCCTTCTT CGGGTTCAAA CTCAACAGAT
AACCCACCGT CAGCATCGAC ACATTACACC ATCACCCCGA TATTCGATTC ACTGAAGGGA
AGTATCACCT GCAACGGCAC TTCGTTAGTG AACAATAGTC CATTCGACCT TACCGCAGGA
GAAACGCCAA CGCTCACATT CACAAACATT GCAGGGTTCA AATTCGACAA GGCAATCATC
AGCGGGGACA AAGGCGCATC CACCACAGTG ACAACGAGCC CATACAGATT CACCACGGGT
GTAGACCAGA ACTACACTGT CACGGTAACA TTCGCAGAGA TAGCACCACC TGCGCCAGCA
CATTACACCA TCACCCCGAT ATTCGATTCA CTGAAGGGAA GTATCACCTG CAATGGCACT
TCGTTAGTGA ACAATAGTCC ATTCGATCTT ACCGCAGGAG AAACGCCAAC GCTCACATTC
ACAAACATCG CAGGGTTCAA ATTCGACAAG GCAATCATCA GCGGGGACAA AGGCGCATCC
ACTACAGTGA CAACGAGCCC ATACACATTC TCCACGGGTG TAGACCAGAA CTACACTGTC
ACGGTAACAT TCGCAGAGAT AGCACCACCT GCGCCAGCAC ATTACACCAT CACCCCGATA
TTCGATTCAC TGAAGGGAAG TATCACCTGC AATGGCACTT CGTTAGTGAA CAATAGTCCA
TTCGACCTTA CCGCAGGAGA AACGCCAACG CTCACATTCA CAAACATCGC AGGGTTCAAA
TTCGACAAGG CAATCATCAC CGGGGATAAA GGCGCATCCA CTACAGTGAC AACGAGCCCA
TACACATTCA CCACAGGTGT AGACCAGAAC TACACTGTCA CGGTAACATT CGCAGAGATA
GCACCACCTG CGCCAGCACA TTACACCATC ACCCCGATAT TCGATTCACT GAAGGGAAGT
ATCACCTGCA ATGGCACTTC GTTAGTGAAC AATAGTCCAT TCGATCTTAC CGCAGGAGAA
ACGCCAACGC TCACATTCAC AAACATCGCA GGGTTCAAAT TCGACAAGGC AATCATCAGC
GGGGACAAAG GCGCATCCAC TACAGTGACA ACGAGCCCAT ACACATTCTC CACGGGTGTA
GACCAGAACT ACACTGCCAC GGTAACATTC ACAGCCAATG TCTACACGGT GAACGTCACC
TCCGGCCTGA ACGGCTCGGT CCGGGCGAAC GACCAGACGG TCGCGGCCGG CACTTCCGGA
ACGGTCAACG TCGGTTACAA CGGAGAGGTG AACTTCACCT ACCTGCCAAC CACCGGCCAT
CACATCGACA CCGTGACCTA CACCGCCGAG GACGGCAGCC AGAAGCAGCT GCCGATCGTC
GACAACACCA CCTCCACACT CCCGGGTATC ACGAGCAACT GCACGGTCGT CGCGACCTTC
ACAGTCGACC GGTTCGCCAT CACCGTCACC CCACCAGTCC ACGGCAGCAT CACTCCGAAC
AGCACGCAGA TGGTCGACTA CAACGACACG CCAACCTTCA CCTTCACCCC TGAGGCGGGA
TACCACCTCG GCACGGTGAC GGTCGACGGC ACATCGGTCA CCCCGGCCGG GAACACCTAC
ACGTTCCCGC CAGTCACCGG GCCGCACACC CTGGCTGCGA CGTTTGTGGT CGACACGTTC
ACCATCAACG CATCGGCAGG GGACCATGGG ACGATCAACC CGAACGGCGC AGTTCCGGCC
AATTACGGTG AATCGAAGGA CTTCACCATC ACCGCGAACA CCGGATACCA TATCAACAGT
GTGACCGTCG ACAGCACGCC CCAGACCATT CCGGCAGGCA ACACCTCGTA CCTCTACACC
TTCACCAACA TCCAGGCGAG TCACACCATC AACGCCAGTT TCGCGATCAA CACCTACACG
ATCAACGCGA CCGCAGGGGA TCATGGAACG ATCACGCCGA ACGGCACGCA GACGGCCAAC
TACGGAGATT CCAGAACCTT CACTATCACG CCGGCGACCG GGTATACGAT CGCTGATGTG
ACGGTCGACG GGGTCTCACA GGGAGCGGTT ACGACCTACG CACTCACCAA CATCCAGGGC
GATCACACCA TCACGGCCAC CTTCACGCAG GTCATATACA CGATCACCGC GAACACCGGA
CCGCTCGGGA CGATCACCCC GTCCGGGTCA CTGGGGTATG CCTACCATGC GATCCCGTCG
TTCACCGTAA GCGCCATCCC TGGCTACCAT ATCGTCAACA TCATCGTCGA CGGAGCGATG
CAGGGGCCGG CCCCGAACTA TACGTTCGCC CCGCTCGAAG CGAACCACAC CATCACGGCT
GACTTCGCAA AGAATGTCTA CACGATCACC GCAACGGCCG GCAGCGGTGG CAGCATAACA
CCAGCCGGGA TCACCTCGGT GAACTACGGA GACAATCAGG TCTACACGAT CGCGGCCGGC
AGCGGCAACA CCATCGATGA TGTGGTCGTC GACGGCATCT CGATCGGCCA GGCCAGTTCA
TTCAACTTCA CGCAGGTGAA CGGGAACCAC ACGATCCAGG CCTACTTCGA CATCAACGGG
GTCGGACCCT ACTCGATCGC CGGGTATATC TGGAACGACC AGAACAATAA CGGAGTCTGG
GACAGTGGCG AACCGCCACT GGCCGGCTGG ACCGTTCAGG CCCAGGGCAA GGGGACGGCC
CAGGACTGGC ACGTGGTGAA CCAGACGGTC TCCGACAGCA CCGGTTACTA TATCATCAGC
AATCTCCCCA AGGACCACTA CCACGTGGTC GAGGTGCAAC AATCCGGCTG GAACCAGACC
TACCCGATCA GTCCGAAATA CTACGATGTC GAGAACCTGA ACCAGGGACG GGCGGACAAA
CACCAGCAGG TGACCGGGGA CAACTTCGGA AACCACATGA CCGCCACCGC AGGAAACCAG
ATCCTGTTGA ACAACCCGAT GAACATCGGG GTGCTGAAGG ACGGGACCTA CATCCGGTTC
ACTGGAACCA CCTGGCGCGA ATGGCCACAG CCCGCCCATA CCGATGACTG GATTGAAATG
GATTCCTCAG CCACCCTCGC ATCCGGACTG GATATCGTTA GTGGAAATAA GTTCAGGATC
CCCTGGCAGA CTCAGGTTAC GATCATGATG GACGGGGATC AGGCAAACGG GCAGATCTAC
ATCTCCAACA CTCAGATCAG CACCTACCAG TTCGACCATG TCAAGGTGTA CTTCAACGGG
AACCTGGTCG CCACAGGTAA GATCACGGGT ATCTGGGTGA ATGGATATAC CAACTGGCAG
TCCACGCTCA CCTACGTGAT GAGTTCCTAC AACTCGGGGG CCCAGTTCGT GGTGAACGGG
ACGAATATCA TCAACCCGCC CTGGTGGCCA TACCAGGACT GGGGTGTCAA TGTGTATGCG
ATTGCTCCAC AGAGTGCCCC CTATAGCAAC GTGCTGAACT TCAACTACCA GAATGGGAGA
ACCTACCTGG TCTGCAGTGG AACCTATGAT CTGTTCACTG CCTGA
 
Protein sequence
MKTNKDRRGS SGEERQRWHE WGVSEVIGVV LLLSLVVVGG TVVGTQLLSQ PAPKEIPNVN 
FLANYNPGNL TLYHTGGDTL PNGDYKFVVQ YLDGTKSQTF NATKDWSSGS PLTFPAVSQP
AKVTLVYTGN GAGETALRSV VMGDQGSDVG EPSSGSNSTD NPPSASTHYT ITPIFDSLKG
SITCNGTSLV NNSPFDLTAG ETPTLTFTNI AGFKFDKAII SGDKGASTTV TTSPYRFTTG
VDQNYTVTVT FAEIAPPAPA HYTITPIFDS LKGSITCNGT SLVNNSPFDL TAGETPTLTF
TNIAGFKFDK AIISGDKGAS TTVTTSPYTF STGVDQNYTV TVTFAEIAPP APAHYTITPI
FDSLKGSITC NGTSLVNNSP FDLTAGETPT LTFTNIAGFK FDKAIITGDK GASTTVTTSP
YTFTTGVDQN YTVTVTFAEI APPAPAHYTI TPIFDSLKGS ITCNGTSLVN NSPFDLTAGE
TPTLTFTNIA GFKFDKAIIS GDKGASTTVT TSPYTFSTGV DQNYTATVTF TANVYTVNVT
SGLNGSVRAN DQTVAAGTSG TVNVGYNGEV NFTYLPTTGH HIDTVTYTAE DGSQKQLPIV
DNTTSTLPGI TSNCTVVATF TVDRFAITVT PPVHGSITPN STQMVDYNDT PTFTFTPEAG
YHLGTVTVDG TSVTPAGNTY TFPPVTGPHT LAATFVVDTF TINASAGDHG TINPNGAVPA
NYGESKDFTI TANTGYHINS VTVDSTPQTI PAGNTSYLYT FTNIQASHTI NASFAINTYT
INATAGDHGT ITPNGTQTAN YGDSRTFTIT PATGYTIADV TVDGVSQGAV TTYALTNIQG
DHTITATFTQ VIYTITANTG PLGTITPSGS LGYAYHAIPS FTVSAIPGYH IVNIIVDGAM
QGPAPNYTFA PLEANHTITA DFAKNVYTIT ATAGSGGSIT PAGITSVNYG DNQVYTIAAG
SGNTIDDVVV DGISIGQASS FNFTQVNGNH TIQAYFDING VGPYSIAGYI WNDQNNNGVW
DSGEPPLAGW TVQAQGKGTA QDWHVVNQTV SDSTGYYIIS NLPKDHYHVV EVQQSGWNQT
YPISPKYYDV ENLNQGRADK HQQVTGDNFG NHMTATAGNQ ILLNNPMNIG VLKDGTYIRF
TGTTWREWPQ PAHTDDWIEM DSSATLASGL DIVSGNKFRI PWQTQVTIMM DGDQANGQIY
ISNTQISTYQ FDHVKVYFNG NLVATGKITG IWVNGYTNWQ STLTYVMSSY NSGAQFVVNG
TNIINPPWWP YQDWGVNVYA IAPQSAPYSN VLNFNYQNGR TYLVCSGTYD LFTA