Gene Mpal_0039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0039 
Symbol 
ID7272208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp37653 
End bp38933 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content67% 
IMG OID643568698 
Productnucleic acid binding OB-fold tRNA/helicase-type 
Protein accessionYP_002465158 
Protein GI219850726 
COG category[L] Replication, recombination and repair 
COG ID[COG1599] Single-stranded DNA-binding replication protein A (RPA), large (70 kD) subunit and related ssDNA-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.858141 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTTTC ACTATGCCCT GGTCGACGAT CTGATGACCG GGTCCGAATT CGAGGCGAAG 
GTCGAGGCGC AGATCGCGGC CTGCGGCGAC CTGATCGACG AGCAGACCGC GGCGATGCTG
GTGGTCCGGG ACGCTGGCCG TCAGCATCAG AAGATCCGGG ACCTGGCCGG CCGGTCCAGC
CTGGTCCTCT TCTTTGCGAG GCTCCTCTCA GTCACCCCTC CGCGCGCGTT CACCCGGAAG
GACGGGAGCG AAGGGATGGT CTCGACGATG ACCGTTGGGG ACGAGTCGGG CAGGATCGAG
GTGACCCTCT GGGACGAGCG GGCAGAGGCC GCGGCAGAAC TCGAGGTTGG CGAGGTCTTT
GAGATCATCG CGAAACGATC TGAGCGGCGG GCCGGCGAGT TCACCCTGAT GGCCATGCAG
AAGGCGGCCT GCGAGATCCG GTGCCAGGCG CCGCAGCCTG TGCAGGAGGG GGAGGATGCC
CCTCCAGAAC TGGTCGTCAG GCTGCTGGCT TTGGAGGAGC TGCGGACCTT CTCGCGGCGG
GACGGGACGA CCGGGGAGAT GGTCGGGGCC CTGATCGGTA CCGAGGCCGG GACGGCCCGG
CTGGTCTGCT GGTACCCCCC GCTCCTCGAC GGGTTCACGG CCGGGGAGGT GATCCGGCTG
GTCGGGGCGA CCCGAAACCA CCGCTCTGAA CGGCCGGAGT ACATGATCAG CGAGACCGGG
GAGATCACGG CCACCGATCA GGAGGTGACG GTGCCCCTGA CCCCGCTCTC GGCGCTGGTC
CCGGACCAGA CCTTCTCGGT TGCCGGGACG GTCCACGCGA TCCAGCCGGC CCGGGAGTTC
GTGACCAGGA ACGGATCCCG TTCCTGGGTG CGGAACCTGA TCATCGGGGA TAAGGAGACG
GAGGTGCCGG TGGTGCTCTG GGGGGACCAT GCCCTGCAGG GGATCAGTGT CGGGGAGGCT
CTCTCTCTCT ACCACCTTTC GGCCCGGACC GGTCGTTCTG GTTCCCTGGA ACTGAGTGCC
GGCTGGGGGA GCGTGATCAT CACCCCGGCA CCCCCGCCCG AGGAGGTGAT CTTCTCCGGG
ACGGTGATCG TGACCAGGGC GGGTACCTTC CTGGATACCG ACGACGGCCG GTACCTGCTG
ACCGGCGGGG CGCCGCATGG CCGAACGGTG GAGGTGACCG GGGTGTTACA GGGTTACCGG
CTGACCCCGG CCAGCACCCG GCTGCTGGTT CCTGACTATG CTGCCGTGAA AGAGCGGATC
AGACATCTTA TCGACGGGTA A
 
Protein sequence
MQFHYALVDD LMTGSEFEAK VEAQIAACGD LIDEQTAAML VVRDAGRQHQ KIRDLAGRSS 
LVLFFARLLS VTPPRAFTRK DGSEGMVSTM TVGDESGRIE VTLWDERAEA AAELEVGEVF
EIIAKRSERR AGEFTLMAMQ KAACEIRCQA PQPVQEGEDA PPELVVRLLA LEELRTFSRR
DGTTGEMVGA LIGTEAGTAR LVCWYPPLLD GFTAGEVIRL VGATRNHRSE RPEYMISETG
EITATDQEVT VPLTPLSALV PDQTFSVAGT VHAIQPAREF VTRNGSRSWV RNLIIGDKET
EVPVVLWGDH ALQGISVGEA LSLYHLSART GRSGSLELSA GWGSVIITPA PPPEEVIFSG
TVIVTRAGTF LDTDDGRYLL TGGAPHGRTV EVTGVLQGYR LTPASTRLLV PDYAAVKERI
RHLIDG