Gene Mpal_2041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2041 
Symbol 
ID7272022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2161846 
End bp2162898 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content56% 
IMG OID643570653 
Producthypothetical protein 
Protein accessionYP_002467063 
Protein GI219852631 
COG category[R] General function prediction only 
COG ID[COG3943] Virulence protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGAGC CGGCCGGAGG TTTGTCCGAG TTTCTTCTCT ACCAGACCGA AGGGGGCGAG 
ACCCGTGTCC AGGTCAGCCT GTTCGAGGGG ACGGTCTGGC TGACGCAGCG GTTGATCGCC
GAGTTGTATC AGAAATCGAT CAAGACAATC AACGAGCATA TCAAAAACAT CTACGAGGAG
CGGGAACTCG ACCCCCAAGC AACTATCCGG AAATTCCGGA TAGTTCAACT GGAGGGTGAC
CGCCAGGTCG AGCGTCTGGT GGACTTCTAC AACCTCGATA TGATCCTTGC CGTGGGCTAC
CGGGTGCGGT CGCACAGGGG GACCCAATTT CGACAGTGGG CGACGAAGCA ACTGCGGGAA
TATGTGGTGA AGGGTTTTGT CCTGGACGAT GAGCGCCTGA AGGAGGCCGG CGGGATCGGG
AGGGACTACT TCGACGAACT TCTGGAGCGG ATCCGTGACA TCAGGGCGTC GGAGAGGCGG
TTCTACCAGA AGATTACTGA TATCTATGCG ACGAGCGTCG ACTATGATCC GAAAGACCCG
ATGACACTTG AGTTCTTCAA GACGGTACAG AACAAGATGC ACTGGGCGAT CCACGGGCAT
ACGGCGGCTG AGACGATATT CCTCCGGGCT GATGCGAGGA AGCCGCACAT GGGGCTGACG
ACCTGGAAAC AGGGGCCGAA AGGGCGGATC CACAAGACCG ATGTGGGGGT TGCGAAGAAC
TACCTGACCA GGGAAGAGAT CTCAAACCTG AACCTGATCG TGAACCAGTA TCTCGACTTC
GCTGAGTTTC AGGCCCGCCA GCGCCGGGAG ATGAGGATGG AGGACTGGAT CAGGAAACTG
GATGGTTTTA TTCAGTTGAA TGACCGGAAC GTTCTCAAAA ACGCCGGGAG TATTTCGGCA
GAAAGGGCGA AGCAGAAGGC GCAGAAGGAG TTTGAGGGGT CTGAAGCGCA GCGCCGTATC
AAGGAGGCGA GCGAGCCGAC AAGCGACTTC GACCTGATGG TCGACGAGGT TACGTATCTC
TCCAAGAAGC AGGAGGACGA AGATGAAGTC TGA
 
Protein sequence
MREPAGGLSE FLLYQTEGGE TRVQVSLFEG TVWLTQRLIA ELYQKSIKTI NEHIKNIYEE 
RELDPQATIR KFRIVQLEGD RQVERLVDFY NLDMILAVGY RVRSHRGTQF RQWATKQLRE
YVVKGFVLDD ERLKEAGGIG RDYFDELLER IRDIRASERR FYQKITDIYA TSVDYDPKDP
MTLEFFKTVQ NKMHWAIHGH TAAETIFLRA DARKPHMGLT TWKQGPKGRI HKTDVGVAKN
YLTREEISNL NLIVNQYLDF AEFQARQRRE MRMEDWIRKL DGFIQLNDRN VLKNAGSISA
ERAKQKAQKE FEGSEAQRRI KEASEPTSDF DLMVDEVTYL SKKQEDEDEV