Gene Mpal_0734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_0734 
Symbol 
ID7270468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp730550 
End bp732334 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content56% 
IMG OID643569377 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002465821 
Protein GI219851389 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.780369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAA GAAGAGGACC CAGTGCGTTC TTCGTCTGCT GTTCGATCCT GTTTCTGATC 
CTCATCTGCT GCATGCCAGT CTCCGCCGGG ATCCCGTGCG ATACGAATGG CGACGGGGTT
GTCTCTGAGG ACGAACTCTC GACAGGAATT CTGGATTATC TGAACGCACC CTCTGCGGCC
GGTCCCGCCA CGCTCGATAA GGCAGATCTC TCCCTCGCGG CTCACAACAC CCTGCACATC
CCGTACGGCC AACTCGTCGT AGCGGTCGAC AGCGAAAATG AACTCCTGCC GAGGGCATAT
ATCGATGAAA AAGGGGTTCC AGAAGATTCT CTCATCTATG AGGGGTTCGT TACCAGAGCG
CGTGACAAGG AAGATCTCGG CTGGCTGGCG TCTTCGTGGG AGCACTCTGC CGATGGAAAA
ACCTGGACGT TTCATATCCC GTCAGGGTCA ACCTGGCAGG ACGGCGTTCC CGTCACCGCG
AACGACGCTG TCTTCTCATA CCAGTATCTC AACGATAAGG GACTCAAGAA CAGGAACGTC
CTGTCGAATG TCACCAATGT GACGGCTCTC AACAACACGA CCGTGGTCTA TTCCCTGGAT
ATCTGCATGC CGCAGTTCCC CGACCTTCTT GCGACCGGAC CGGGCATTGC GATCTTCCCG
GAACATATCT GGTCGTCCAT CTCAAATCCA AATAAGGAAT CTGATCCTGA TTACATCGGT
TCCGGTCCTT TCATGTACAA TTCGAGCATC TCAGGAGATT CCTACAAACT TGAAGCATAT
CGGGGGTATC ATGGATCGGT ACCGTATGTC AACACGATTG TTCGAAAACT CTACTCCTCT
GAAGATACCA GGGTCCTTGC CCTGAAGAAC GGGGATGTCG ACTTCGTATC AGATCTGGCT
CCGGCCACCG CCCACTCGCT CCAGGGAGTG AACGGGATAG GAGTGACGAC AATTCCCGCC
GGAGGTAAGT CCTTCGAAGT CGCGTTCAAC CTCGACATCT ATCCGGCGAA CAACACGCTC
TTTCGGGAGG CTCTCAGCCA TGCTGTGAAC AGGGACAGGA TGTGTCAGCT GATCGACCGG
CAGGCAACCG AGGCGACCAG TACGGCATTT CTGATCCCCG CCCTCGCCGG CGACCAGGTG
AACAACGCCA CGAACGACAG GTACGGCTAC GACCTCACCG CAGCAAAAGC GCTGCTCGCC
CAGGCCGGCT TCACCCTAAA GACCGAGAAC GGGAAGAACA TCCTCTACGG ACCGGACAGC
CAGGTGGTCA CGATCACCAT CCCGCTCGGC GGCAAAGCCT CGGCCAACGG TGTCGACGAG
AAGATCGTCC AGGTGCTCAA GGAGGACTGG GAGACAGCGC TCGGCATCAC ACTCACCGTC
GAGAACCTCA AGGCCGATGA CGATCAGTAT GACAAACGGA TCGACGGAGA CGCAGTCCGT
ATCGACGGCA TGCCGAGTTA CTTCCACGAC GACATCGCCC GCCTGAATAA TTTCCAGTCG
AGTCCGCTTG GCAAGAATTA CTACCACTTC GACAACGGAT CCTTCAACGA ACTGATCGAC
ACCCTCCAGA ATACCGTGGA GGAGTCACAG AGAAAGGCGA TAGGAGACCA GCTTCAGGAG
ATCCTCGTCG AGCAGATCCC CTGCATACCG GTCTGTTCGA TGGACGCCTT CGATGCCTAC
CGTTCCGACC GGTTCTACGG TTTCGATTCC CTGATACATA AGGACGGGGG CGATATCAAC
ATCTTCTCAC ATGTCAAGCC TGCGAGTACG GAGGTGAACA GGTAG
 
Protein sequence
MSERRGPSAF FVCCSILFLI LICCMPVSAG IPCDTNGDGV VSEDELSTGI LDYLNAPSAA 
GPATLDKADL SLAAHNTLHI PYGQLVVAVD SENELLPRAY IDEKGVPEDS LIYEGFVTRA
RDKEDLGWLA SSWEHSADGK TWTFHIPSGS TWQDGVPVTA NDAVFSYQYL NDKGLKNRNV
LSNVTNVTAL NNTTVVYSLD ICMPQFPDLL ATGPGIAIFP EHIWSSISNP NKESDPDYIG
SGPFMYNSSI SGDSYKLEAY RGYHGSVPYV NTIVRKLYSS EDTRVLALKN GDVDFVSDLA
PATAHSLQGV NGIGVTTIPA GGKSFEVAFN LDIYPANNTL FREALSHAVN RDRMCQLIDR
QATEATSTAF LIPALAGDQV NNATNDRYGY DLTAAKALLA QAGFTLKTEN GKNILYGPDS
QVVTITIPLG GKASANGVDE KIVQVLKEDW ETALGITLTV ENLKADDDQY DKRIDGDAVR
IDGMPSYFHD DIARLNNFQS SPLGKNYYHF DNGSFNELID TLQNTVEESQ RKAIGDQLQE
ILVEQIPCIP VCSMDAFDAY RSDRFYGFDS LIHKDGGDIN IFSHVKPAST EVNR