Gene Mpal_0389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0389 
Symbol 
ID7271415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp407178 
End bp408419 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content65% 
IMG OID643569035 
Productprotein of unknown function DUF58 
Protein accessionYP_002465487 
Protein GI219851055 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.004479 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGGCCGA CACGGTGGAC CGGGTCGGCT GCGGTCGTCG CCGCCGTGCT CGGCACAATC 
GGGCTCTTCT TCGATGCCCC GGCAGCTGCC GGTGCTTCGG TCGGGCTCGC CGCCCTCCTC
GCAGGGGGAG CAGTGCTCTT TCTCTATCGC ACGATCCGGT ATGCGGACAC CCTCGCTGTC
GAGCGGGTCA TTGGGACGGG GCTCGTCTGC CAGGGAACTC CGGTGGAGGT CGGGGTGCAG
GTGACTGGTG AGGCTGTGTC CGGGCTCGCG GTCCGGATCA TGGACCTCCC CCCGCGTTCG
GCGGTGCACG ATCCCAAAGA GACGGTTCTT AAGGGTGGAG AGGGCCGGTA CCGAATCCGG
CTGATGGCCC CTGGCGAGGT CTTCTTCCGT GGACTTCGGG TTGAGGTTGC GGATCGGTTC
TTCACAACAA CGCTCTTCTG CACAGCCCCG CGGTTTGCAG GAACGATGCT GACCGTGTAC
TCGCCCGATA GTTATCGTCA TGAAAAAGGG CCCGGATCAG GAGCCGGGGA ACTCGAGATC
GAGCGGAAAG GGGTGCTGCG TGGACAGGGG ATCCGTTCAT TCCGTCCATT TCGGTCAGGG
GACGATCCGG CCCTGATGGA TTGGAAACTC TCGGCCAAGC ACGGCCGGCT CTTCGTCCGC
GAGCCAAACA GCCAGGTTGG AGGTCCCCCT CTGCTGATCG TCGACCTCCC GGTCGTCGGA
GCAGAGGGAG GTGAAGATCT CCTCCTCGCC GCTGGTGAAG CGATCGAACG GGAGATCCGG
GAGTACGACC ACTGTACCCT CCTCGTGATT GCAGGTGGCG AGGTGATCGG GTTCTGGTAT
CATGAGCAGG ACCTGCCGGC ACTGGTTCGG AACCTCCGGT CCCGGCCGGC CGATTCCGTT
GCTCCCCTCT ACCGGGTGTA TGATCCCATC GTCCTCAAGC GGCGGCTTCG GGCAGCTGAA
CGGGGTGTCT CCGAACCTTC GCGACGATTC GCCGCCGTGC TTCGTGCCAC CCTCGGTTCA
TCTTCCGCCT TCGCCTTCGA GGAGGAGGTC GGGCGGATCC TGGCCAGTGT CGAACACCCT
GAGGTGATCG TCTACACGGC CGCGACCGAT GAGATCAGCC ATCTGAATCT GATCGCCGTG
GCTGCCCGCC GCCACGACCG CACCCTCAGG ATTCTGCTGA CGCGCCCAGG GCCGGGTCTT
CTGGCCAGGC TCAGTTCGTA TGCCCGGGTG GAGGTGCTCT GA
 
Protein sequence
MRPTRWTGSA AVVAAVLGTI GLFFDAPAAA GASVGLAALL AGGAVLFLYR TIRYADTLAV 
ERVIGTGLVC QGTPVEVGVQ VTGEAVSGLA VRIMDLPPRS AVHDPKETVL KGGEGRYRIR
LMAPGEVFFR GLRVEVADRF FTTTLFCTAP RFAGTMLTVY SPDSYRHEKG PGSGAGELEI
ERKGVLRGQG IRSFRPFRSG DDPALMDWKL SAKHGRLFVR EPNSQVGGPP LLIVDLPVVG
AEGGEDLLLA AGEAIEREIR EYDHCTLLVI AGGEVIGFWY HEQDLPALVR NLRSRPADSV
APLYRVYDPI VLKRRLRAAE RGVSEPSRRF AAVLRATLGS SSAFAFEEEV GRILASVEHP
EVIVYTAATD EISHLNLIAV AARRHDRTLR ILLTRPGPGL LARLSSYARV EVL