Gene Mpal_0021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0021 
Symbol 
ID7270133 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp20571 
End bp21878 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content62% 
IMG OID643568680 
Productprotein of unknown function DUF21 
Protein accessionYP_002465140 
Protein GI219850708 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.099852 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCAC TCATTGAGAT CGGTATCATC CTTCTGCTGA TCCTCTTCAA CGGCCTCTTC 
TCGATGGCAG AGTTTGCGAT CGTATCAGCC CGCAAGATCA GGCTCTCCCA GCTGGCTGCG
GATGGGGATA AGCGGGCTGC AGTGGCCCTG GAACTTGCCG AGGAGCCGAA CCGTCTCCTC
TCTGCCGTCC AGATCGGAAT CACTGTCATC AGTATCGTCT CCGGTGCCTA TGGTGGGGCA
GCCCTCTCCG GGTACGTGGC GGCACCCCTC AAGTCGATCC CGGAGGTGGC TCAGTACAGC
GACCTTCTGG CCCTGGTGTT GGTTGTCGCT GCCATCACCT ACTTGACCCT GGTCTTTGGG
GAACTGGTCC CAAAAAGGCT CGCCCTCACG AATCCGGAGC AATTTGCGGC GTCGGTCGCG
GTCCCGATGA AGTGGTTCGC CTGGGTGGGA TCTCCGCTCG TTTCACTCCT CTCGTACTCC
ACCGATCTCG TGCTGGCAAT GCTCGGGGCA AAGAATTCGT CAGGGTCCCC GGTGACTGAG
GAGGAGGTGA AGCTCCTGAT ACGGGAGGGG ACACAGGCGG GGGTCTTCCT GGAGGAGGAG
CAGGCGATGG TCAGCCGGAT TCTCCGTCTT TCAGACCGGC GGGTATCCGG GCTGATGACA
CCACGACCTG AGATCACGGC GATAGATCTC CGGAGTCCAG ATCTGGAGCA GATCGCCCTG
ATGCGGGCCA GCGGCCACTC GTACTTTCCG GTGATCGACG GTGATCTGGA TCGGATCCGG
GGGATGGTCT CGGTCCGGGA CCTCTGGGCC CGGATGCTCG ATGGTCAGGA GGCCACAGTC
AGGGGCGCCC TGAGCGAACC ACTTTATATC CCTGAGTCAG TGCCAGCGCT GAAGGTGCCG
GCCCTCTTCC GGGACGCCGG TCTTCATCTG GGTCTGGTCA CCGACGAATA TGGATCGGTG
CAGGGCCTGG TCACCCCGCA CGACATCCTG GAATCGATCG TCGGGGTCCT CCCCTCCCCA
GATCAGGAGG CCGAGCCTGA GATCGTTCAG CGGGATGATG GCTCCTGGCT GGTCGACGGG
ATGCTCCCGC TCGACCAGTT CCGCGATGTC GTGCCGCTTG AGGACCTGCC GCTCGAGGAG
AAGGGGTATT ACCATACGAT CGGCGGACTT GTGATGATGC ATCTTGAACG GAGGCCGCAA
ACCGGGGACC GGTTTACCCA TGGGGACTTG CAGTTCGAGG TGGTGGACAT GGATGGCAAC
CGGGTCGACA AGGTGCTGGT CACCCAGGTC GATGAGGCAG ATCAGTAG
 
Protein sequence
MAALIEIGII LLLILFNGLF SMAEFAIVSA RKIRLSQLAA DGDKRAAVAL ELAEEPNRLL 
SAVQIGITVI SIVSGAYGGA ALSGYVAAPL KSIPEVAQYS DLLALVLVVA AITYLTLVFG
ELVPKRLALT NPEQFAASVA VPMKWFAWVG SPLVSLLSYS TDLVLAMLGA KNSSGSPVTE
EEVKLLIREG TQAGVFLEEE QAMVSRILRL SDRRVSGLMT PRPEITAIDL RSPDLEQIAL
MRASGHSYFP VIDGDLDRIR GMVSVRDLWA RMLDGQEATV RGALSEPLYI PESVPALKVP
ALFRDAGLHL GLVTDEYGSV QGLVTPHDIL ESIVGVLPSP DQEAEPEIVQ RDDGSWLVDG
MLPLDQFRDV VPLEDLPLEE KGYYHTIGGL VMMHLERRPQ TGDRFTHGDL QFEVVDMDGN
RVDKVLVTQV DEADQ