Gene Mpal_1916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1916 
Symbol 
ID7272733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp2031565 
End bp2033016 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content59% 
IMG OID643570530 
Productextracellular solute-binding protein family 3 
Protein accessionYP_002466943 
Protein GI219852511 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.114513 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGCAG GCGCCCTGCT CGTCGTTCTG ATGATCGGAG CCGGGTGCAC GACAGTAAAG 
AACGAAACCG GAGATAAACT GGTCTTCTAC ACCGATCAGT ATCCTCCCTT CAGTAGCCAG
GAGAACGGCA CGCCAACCGG GATCATGGTC GACATGCTGA ACTCGACGAT GACAGAGATG
GGCAGGGGGC CGGCCGACAT CAAGGTCACC TGCTGGACCA GTGCCTATCA GACTGTCCTC
TCCACCCAAA ATACGGTGCT CTTTTCGACC ACACGCACCT CTGATCGTGA GCAGCTCTTC
AAGTGGGCCG GCCCGGTCCT GACCGATAAG GTGGTGGCCT TCTCCTACCG CGAACGACCG
GTCGTGGTGA ACAGCACCGC CGATCTGAAA CGCTACCGGA TCGGAGCGGA GGAGAACGAC
GCAGTCATCG GGAACCTCCT TTCCCTCGGG GTTCCGAAGG AGCAGATCGT GACCGCTCCA
GACCCCCAGA CCATGATCAG GCAGGTCCAG AACGGCTCGA CCGACCTCTT CGCCTATGGA
GAGGAAGCCG GTAACTACTG GATCGCACAG TCCGGGACCA GTTCAGGCCT CTTCTCAACA
GTCGTCACCA TCAGGGAGGA CCCGGTCTAT TACGCGTTCA ACCGGAACAC CTCGGACCAG
ACCGTCCAGG CCTTCCAGCA GGCCCTCAAC CGGTCTATCC AGTCCGATCT CGACCGGGTT
CTCGATGCAA ATCTGCCCGA GCGTAGCCTC GCCCGGCTCA ATTATCTGAC TGAAGAGTCC
CGGCCGTACA ACTTCGTGGC GAACGGGACC GTGCAGGGGA TCTCGGTCGA TCTCCTCAAC
GCGACGCTCT CCCGGCTCGG TGTCCCGGCG AATGCCACAT CAGTCAGGAT CGTCCCCTGG
AATGAGGGGT ATACAGATAC ACTGACGAAG AACGATACGG TCCTCTTCGC GACCGCCCGA
AACCCTGAAC GTGAGAACCT CTTCCTGTGG GCAGGACCGA TCGGGCGGCA CGATTATGTT
CTCTTTGCGG ACAGGACCAG AAATATCTCG ATCTCGACCG ATGCCGACCT CGCCCGGTAC
CGGATCGGAG CCGTCACTGG TGACGTCGGA GTCAAGTACC TGGCCGACCA TGGTGTCCCA
AAAGATCGGC TGGTGCTCGA TGCCAATGCA ACAACAGGGG TTCAGCGACT CGCCTCCGGA
GAGATCGACC TCTTCGCCGA TTCCATGGAG CCCAACCAGA CGGAACTGAA CAGCACGGTC
GCGAATTCGG AACGGTTCCA GAATGTATAC ACCATCGGGG GGAGCGAACT CTACTATGCA
TTCAACCGGA ATGTCTCGCC AGAGCTGGTC AGGGCCTTCC AGCGGGGGCT CGATAGCGTG
AAGAACGAGA AGGATACGAG CGGGGTCTCG GACTACGAAC GGATCATGGA AAAGTACGCA
GGGGTCAGGT GA
 
Protein sequence
MVAGALLVVL MIGAGCTTVK NETGDKLVFY TDQYPPFSSQ ENGTPTGIMV DMLNSTMTEM 
GRGPADIKVT CWTSAYQTVL STQNTVLFST TRTSDREQLF KWAGPVLTDK VVAFSYRERP
VVVNSTADLK RYRIGAEEND AVIGNLLSLG VPKEQIVTAP DPQTMIRQVQ NGSTDLFAYG
EEAGNYWIAQ SGTSSGLFST VVTIREDPVY YAFNRNTSDQ TVQAFQQALN RSIQSDLDRV
LDANLPERSL ARLNYLTEES RPYNFVANGT VQGISVDLLN ATLSRLGVPA NATSVRIVPW
NEGYTDTLTK NDTVLFATAR NPERENLFLW AGPIGRHDYV LFADRTRNIS ISTDADLARY
RIGAVTGDVG VKYLADHGVP KDRLVLDANA TTGVQRLASG EIDLFADSME PNQTELNSTV
ANSERFQNVY TIGGSELYYA FNRNVSPELV RAFQRGLDSV KNEKDTSGVS DYERIMEKYA
GVR