Gene Mpal_1681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1681 
Symbol 
ID7271244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1739131 
End bp1740336 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content51% 
IMG OID643570298 
Productputative transcriptional regulator, GntR family 
Protein accessionYP_002466714 
Protein GI219852282 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTACA CTTTCGCCTC CAGAATGGGA AAGACCCCCC GGTCATTCAT CAGGGAGATC 
CTTAAAGTGA CAGAACGTCC CGAGGTCATT TCATTTGCTG GGGGACTCCC AAATCCGGCT
TTGTTCCCTG TTGAGAGGCT GGCTGAGTCT GCAAGGGCGG TTATCACGGG TGAGGGATCG
GCTGCTCTTC AATATGCCAC CACTGAAGGA TATCTTCCTC TGCGCGAATG GATCGCTGAA
CGGTACAAAA TTCGGCTTGG TATGGATGTA TCCCCTGATG AGATCCTGAT CACTAACGGT
TCACAGCAAT GTCTGGATCT TATCGGAAAG GTCTTCATTG ACCCGGGTTC CCGGGTTGCA
ATCGAACGAC CTGGCTATCT TGGGGCAATC CAGGCCTTTT CGCTCTATGA GCCAGAGTTC
ATGAATGTTC CACTTACTTC AGAGGGTCCT GATCCGACTG TCCTCAACTC TGTTCTTTCT
GAGGGGAATC CACGTATCTT CTATGGAGTC CCTAACTCGC AGAACCCTTC GGGAATCACC
TGGTCTCTTG AGAATAGGTG CAGAGTCGCC AAAATAATAC AGCGATCAGA GACAATCCTG
GTTGAGGATG ATGCATATGG TGAACTCAGG TTTTGCGGGG ATCAGATGCC TGCGATGCGG
TCCTTCCTGC CTGATAAAAC CGTGATGACC GGTTCGTTCT CAAAGATCAT CGCTCCGGGG
ATGCGGATGG GTTGGGTCTG TGCCCCGTAT GAGATTATGG AACAGATCGT CACCGCCAAG
CAGGGAACCG ATCTCCATTC AAACATCTTG AGTCAGCGGA TTATATCTCG GTTCCTGGCT
GATTACTCTA TTGACGAGCA TATCAGAGCG ATCACCGATG CTTATGCGCA CCAGCGGGAT
TGTATGCTTG CGGCTATAAA TGAGCATTTT CCCAAAGAAG TCACGTGTAC GAGTCCTGAC
GGGGGGATGT TCCTCTGGGC TACATTGCCG GATGGATTCT CCTCAACTGA ACTCTTCAAC
AGAGCATTGG CAGAGAATGT TGCCATCCTA CCTGGTGTTC CTTTCTATAC CGACGGAGGC
GGGGAGTCCA CTATGCGTTT GAACTTTTCA AATGCCAGTG ATGAGCGGAT CTGGGAAGGG
ATTGCTAGGC TCGGAAGTGT GCTTCATCAG TATATGCAAA CCGGCAGGAT CGCTGAAAAT
AACTGA
 
Protein sequence
MQYTFASRMG KTPRSFIREI LKVTERPEVI SFAGGLPNPA LFPVERLAES ARAVITGEGS 
AALQYATTEG YLPLREWIAE RYKIRLGMDV SPDEILITNG SQQCLDLIGK VFIDPGSRVA
IERPGYLGAI QAFSLYEPEF MNVPLTSEGP DPTVLNSVLS EGNPRIFYGV PNSQNPSGIT
WSLENRCRVA KIIQRSETIL VEDDAYGELR FCGDQMPAMR SFLPDKTVMT GSFSKIIAPG
MRMGWVCAPY EIMEQIVTAK QGTDLHSNIL SQRIISRFLA DYSIDEHIRA ITDAYAHQRD
CMLAAINEHF PKEVTCTSPD GGMFLWATLP DGFSSTELFN RALAENVAIL PGVPFYTDGG
GESTMRLNFS NASDERIWEG IARLGSVLHQ YMQTGRIAEN N