Gene Mpal_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1040 
Symbol 
ID7271774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1069002 
End bp1070654 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content56% 
IMG OID643569677 
ProductDEAD/DEAH box helicase domain protein 
Protein accessionYP_002466111 
Protein GI219851679 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAAGGA GAAGAGGTAC TGTGAGCGAT ACAATAGAGA AGATATCCAA TACTGATACG 
CCGGCCGAAC CTGTTACCAA TGCATTCGAG GCACTGGGTA TCTCAAAAGA GATCCAGAGA
GCCATCGTTG ATCTCGGGTT CGAAGAGCCG ACCCCCATCC AGCAGATGGC GATCCCCCTG
ATTCATCAGG GATTTGATGT GATCGGCCAG GCCCAGACTG GGACTGGTAA AACGGCGGCT
TTTGGAATAC CAACGCTAGA GAAGATTGAT CCCCTGGATA AACATGTCCA GGCTCTGATT
CTCAGCCCAA CCCGTGAGTT GACCATCCAG ATCGCCGAGG AACTGAGCAA ACTGGCCCGT
TACCGACGCG GGATCGCGAT CCTACCGATC TATGGCGGTC AGCCTATCGA GCGGCAGTTC
GATGCTCTCA GACGGGGCGT TCAGGTCGTG ATCGGAACCC CAGGCAGGGT CATGGACCAT
ATGCGTAGGG GAACGCTGGT CTTCGACCAC GTGAAGACCG TGGTCCTCGA CGAAGCGGAC
GAGATGCTGG ATATGGGCTT CCGGGATGAT ATCGAACTGA TCCTCAAGAC GACGCCGTCA
GACCGGCAGA CCACGCTCTT TTCAGCTACG ATGTCGCAGC CGATCCTGGA ACTGACCAAG
CGGTTTCAGA AGAGCCCGAA GATGGTCAAG GTCACCCACA AGGAACTGAC GGTCGCGGCA
GTTGAACAGA TCTACTACGA GGTTCGCGAA TCGCTGAAGC TCGAGGCGCT GGCCCGCCTG
CTCGATATTT ACAATCCGAA ACTGACCCTG ATCTTCTGCA ACACCAAGCG GCGGGTCGAT
GAACTGGTCG GACAGTTACA GGTCAGGGGA TATGCTGCGG AGGCTCTCCA TGGAGACCTA
AAGCAGTCAC AGCGCGACCG GGTGATGGGC AGGTTCAGAT CCGGTGGGAT CGATATCCTG
GTCGCGACCG ATGTCGCAGC CCGTGGGATC GATGTCGACG ATATCGAGGC GGTCTTCAAC
TACGATATTC CGCAGGACGA GGAGTATTAT GTGCACCGGA TCGGCAGGAC CGGACGGGCC
GGCAGGACCG GGCGTGCGTT CACCTTCGTC TCGGGTAAGG AGATCTGGAA AATCCGGGAT
ATCCAGCGGT ACACCAACAC CCGCGTGATC CAGGCCCAGG TGCCGACCCT CTCGGATGTC
GAGGAGATCC GGACCACACT CTTCATCGAC AAGGTGAAGA CGATCGTCGA TGCAGGTGGG
CTTGAAAAGT ACGTCTCGAT GATCGAGAAA CTGATGCGTG ACGACTACGC TTCGCTTGAT
ATCGCAGCAG CACTGCTGAA GATGCGGATG GAGCGTGATA CCAAGGAGGA GACCGCCGCC
GAGCCGGACT TCAAGAATAC CGGTGCCGAG GCTGGCATGG TCAGGTTCTT CCTCAATGTC
GGCAGAAACC ACAATGTCCG GGCGAAGGAT ATCCTCGGTG CGATCGCTGG CGAGACCGGG
ATTCCTGGAA AGTCTATCGG TGCGATCAAC ATCTTTGACA GTTACTCGTT CGTCGAGGTG
CCGCTTGAGC ACGCAAAGAC GGTCTACCAG ATCATGAACA AGAACCAGAT CAAAGGGAAT
ACGATCAACA TCGAACCCGC AAACCAGCGG TAA
 
Protein sequence
MIRRRGTVSD TIEKISNTDT PAEPVTNAFE ALGISKEIQR AIVDLGFEEP TPIQQMAIPL 
IHQGFDVIGQ AQTGTGKTAA FGIPTLEKID PLDKHVQALI LSPTRELTIQ IAEELSKLAR
YRRGIAILPI YGGQPIERQF DALRRGVQVV IGTPGRVMDH MRRGTLVFDH VKTVVLDEAD
EMLDMGFRDD IELILKTTPS DRQTTLFSAT MSQPILELTK RFQKSPKMVK VTHKELTVAA
VEQIYYEVRE SLKLEALARL LDIYNPKLTL IFCNTKRRVD ELVGQLQVRG YAAEALHGDL
KQSQRDRVMG RFRSGGIDIL VATDVAARGI DVDDIEAVFN YDIPQDEEYY VHRIGRTGRA
GRTGRAFTFV SGKEIWKIRD IQRYTNTRVI QAQVPTLSDV EEIRTTLFID KVKTIVDAGG
LEKYVSMIEK LMRDDYASLD IAAALLKMRM ERDTKEETAA EPDFKNTGAE AGMVRFFLNV
GRNHNVRAKD ILGAIAGETG IPGKSIGAIN IFDSYSFVEV PLEHAKTVYQ IMNKNQIKGN
TINIEPANQR