Gene Mpal_2113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_2113 
Symbol 
ID7271593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2243303 
End bp2245597 
Gene Length2295 bp 
Protein Length764 aa 
Translation table11 
GC content56% 
IMG OID643570727 
Productextracellular solute-binding protein family 3 
Protein accessionYP_002467134 
Protein GI219852702 
COG category[C] Energy production and conversion
[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain
[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTGC GCAGCCTGAA CCTGACCATC TGGATATTGC TGGGTTTTAT CCTCGGAGCG 
CTCGCTGGTC TGTTCTTTGG GGATCTCTGC TCGGTTCTGA AACCATTCAG CGATGCATTC
ATCAAGATCT GGCAGATCAC CATCCTCCCT TCCGTGGTGC TCTCCCTTAT CGTCGGAATC
GGGAGCCTGA AGAGGGACAG TGCACGGGCG ATCGCCGTGA AAGCGGGATT GGTCCTGCTC
CTGTTCTGGG CTATCGGTGT CGGGATATTT TTTTCATTTC AATGGGCGTT CCCTCCTCGG
GAGGCTGCCT CATTCTTCAG CGCACAGGAC CTGACCGAGA CGGTCGGTTT CAACATCATC
GACCAGTTCA TACCCTCCAA CCCCTTCCAG TCATTGTCAG AGGGGATAAT CCCGGCCACT
GTGCTCTTCT GCCTGTTCCT GGGCTTCGCC CTGATGATGG ATGATGGGAG CGGACCTATC
CTGAGTTCGC TCAGGATCCT GCTGGGCGCG CTCACACGCA TGACGAGCAT CCTGTCCCGG
ACCTTTCCCA TCGGTGTCTT CGTCATCACC GCCAGCACGG CAGGAACCAT GACCTTTGAG
GGTTTTCTGG ATCTTCAGGT CTACCTGGTC TCCCTTGCCG CGGCCACTGT CCTGCTCGGC
CTCGTTGTCC TGCCCCTGCT GATCACCTGT TTCACCACCT TCCGCTATCG GGACATCCTC
GCCGCCTCGT CCAGAGCTGT GGTTCTTGCA TTCTCGACCG GCAGTGAGTT CATCACCCTG
CCGCTGATCA TCGAAGGTGT CAGGAAACTC TTCGAGGGGC CGTCGGGGAT TCCTGTAAGC
AGCGGTGCGG GTGGGAATGC GGACCGGCCT GATGACCTTC CCCAAGAGGA CTGGACTGGG
GAGATGCAGG AACAGGCCCC TGACTGGAGG GACGTCCGGT CGTACAGCGA GATCCTGGTA
CCGGTGGCTT ATACCTTTCC GCTGCTGGGC GGGATCACCC CCTTTCTCTT CATCCTCTTC
GTGGCCTGGC TTTATCAGAG CCCGTCGGAT CTAATGCAGC AGGTGCAACT CATTGCCGTG
GGGATTCCCA CCTTCTTCGG TTCATCCAAA CTCTCGGTGG TATCGCTTCT GAATCTGATG
CACCTCCCTG CAGACGCGTA CAACCTGTAT ATCAGCATGG GGATCTTGCG TCAGTGCTTT
GTGGCTGCCC TCTCCTGTAT GTCGATCTTC TCGTTCTCCA CCATCAGCAT CGCCCTCATC
ACGAACCGGG GCCGGATGCG GTGGCGAAGG ATGATCCCAT CAGTCGTGCT CATTCTCTTG
CTGACCTTTA TCATGATTGG AGGTCTGAAG ATGGGTTTCG CTTTGTTGCT GGCCAACACC
TATCATGGGA ACGATCAGAT CTCTCAGATG GATCTTCCTC TCGATTTGAA TGGGGCGAGG
GTGGATGTGG GTATAAATAC TACAGTCTAT TCGCGTGTTG AGGATGTTCC GCCCCTCGGT
CCCTATGACT TGGGTGGAGG AGGGGAGGAT GTCAAACAGA TCCGGGAGCG GGGTGTGCTT
CGGGTGGGGT ATAACAGCAA TAATATCCCC TTTGTCTTCT TCAACGGAAA GGGTGATCTG
GTGGGATATG ATGTACAGAT GGCCTATGAT CTGGCCCGGT TCGTCAATGT CACCAGGGTC
GAGTTCGTCC CTATCACTGG GGCGACCCTC GCTGATTCCC TCAACAGCGG CTACTGCGAC
ATCGTCATGT CCTCGGTTGC AGTCACACCG GAGAGGCTTG ATGAGATGAA GTTCACCGAC
TCGTACGTCA CTGTTCACAT GTCGTTCGTG GTGCCTGACG AACAAAAGGC GGAGTTTTCG
AAGTTGGAGA ACGTGAAGAA GATGAAGGGT CTCCGGGTCG CGGTCTTCAA CAACACGGCG
CTGGTGAAGG TGGCTGCACA GATGCTTCCT GGAGCGACGA TCGTTCCAAT CGACTCAGAG
GAGGAGTTCT TCGTGGAGAA GAAGGCGGAT GCCCTCATCA CCACGGCTGA GGAGGGGTAT
ACGATGACCA TGCAGTACCC GTTCTACGAC GTGGCCATCA TCGAGCCTAA CGACTCGTAC
CAGATGATGT ATGGGTATGC TGTGGCCCAG AATAGTAGTG ATACCTATCT GATGTCGCTC
AATTACTGGC TCAAGATGGA GAAGGACTAC GGGAACCTGA ACGAAAAGTA CAATTACTGG
ATCCTGGGGA AGGATGCCGG ATTGACCGAG CCACGCTGGT CTGTGGTGCG GAACGTGCTG
CACTGGGAGA CCTGA
 
Protein sequence
MRLRSLNLTI WILLGFILGA LAGLFFGDLC SVLKPFSDAF IKIWQITILP SVVLSLIVGI 
GSLKRDSARA IAVKAGLVLL LFWAIGVGIF FSFQWAFPPR EAASFFSAQD LTETVGFNII
DQFIPSNPFQ SLSEGIIPAT VLFCLFLGFA LMMDDGSGPI LSSLRILLGA LTRMTSILSR
TFPIGVFVIT ASTAGTMTFE GFLDLQVYLV SLAAATVLLG LVVLPLLITC FTTFRYRDIL
AASSRAVVLA FSTGSEFITL PLIIEGVRKL FEGPSGIPVS SGAGGNADRP DDLPQEDWTG
EMQEQAPDWR DVRSYSEILV PVAYTFPLLG GITPFLFILF VAWLYQSPSD LMQQVQLIAV
GIPTFFGSSK LSVVSLLNLM HLPADAYNLY ISMGILRQCF VAALSCMSIF SFSTISIALI
TNRGRMRWRR MIPSVVLILL LTFIMIGGLK MGFALLLANT YHGNDQISQM DLPLDLNGAR
VDVGINTTVY SRVEDVPPLG PYDLGGGGED VKQIRERGVL RVGYNSNNIP FVFFNGKGDL
VGYDVQMAYD LARFVNVTRV EFVPITGATL ADSLNSGYCD IVMSSVAVTP ERLDEMKFTD
SYVTVHMSFV VPDEQKAEFS KLENVKKMKG LRVAVFNNTA LVKVAAQMLP GATIVPIDSE
EEFFVEKKAD ALITTAEEGY TMTMQYPFYD VAIIEPNDSY QMMYGYAVAQ NSSDTYLMSL
NYWLKMEKDY GNLNEKYNYW ILGKDAGLTE PRWSVVRNVL HWET