Gene Mext_3095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3095 
Symbol 
ID5835418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp3444221 
End bp3445381 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content72% 
IMG OID641368895 
Productregulatory protein LuxR 
Protein accessionYP_001640554 
Protein GI163852511 
COG category[K] Transcription 
COG ID[COG2771] DNA-binding HTH domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATCAG GTCGACATCT CTCATGTCGG TCGATGCCCG GTTTCGATCC TCTCGCCTTG 
ATCGACGCGT GTTACGCGGC GGCCGTCGAA CCCGAGACGT GGTCTGACGC GCTCGACGGG
ATCGCCGATG CGGTCGGGGC CCGCGGTGCG GTGATCGTCT CGCACGACCC CGCGCGCACG
GCGCAGACGC TGGCCTCCGA GCGGATCGCC GACGTGAATC TCGATTACGG CGTCGGCGGC
TGGTGGCAGC ACGACACGCG CATCCGCCTC GGCCGGGAAC GCGGCATCAT GCGGCCGGGC
ATGGTCACGG TCGATGAGAT GTACCTGACG GCGGAGGAGA AGCGCGCCGA TCCGTTCTTC
CAGCACTTCA TCGACCGGCA CCGCCTCGCC AACCTCTGCG CCTATGTCGG GGCCGATCCG
ATGGACCGCC ACACCCTGTC GTTCAGCGCC TCACGCGATG TGCGCAACGG GCCGTTCGAG
GGCGCCGAGA TCGAGCGGAT GGGGCTCGTC GGCCCGCACG TCATCCGCGC CTTCCGGCTG
ACGGCGCTCG TCGGCGAGAT GCGGCGCGAG GCGGAGGGTC TGACCTCCGC CCTGGAGCGG
ATGCGCGCCG GCATCGTCGT CCTCGATGTC CGCGGCCGTG TCCGGCTTGT CAGCCCCGTG
GCGGAAAGGC TGGCGGCGGG CTACCTCGCC CTGCGCACCG GTGCCGAACC GGAGGCGGCG
GAGCCCATCG ACAGCGCACG GTTCGGCCGC TTCCTGGCGG ATGTCCTGCC GGAGCGCAGT
TGGCCGCGAC AGGAGACGAT CCTCCTGCGC CGCCGGGGCG GCGGGCGCCC GCTCTATGTC
GAGGCGCTCC CCTTGCGCGG CGCCGATCCG TTTCCGGTGG CCGGCGCCGG GCTCGGCGGC
GGCGTGGTCC TGCTCCTGCG CGACCTGCTC GCGCCGAGCA CGCGCCCGAT CGAGCCGTTG
CTGGAACAGC TCGGCCTCAC CCGGGCCGAG GCGCGGGTCG CGGCGCTGGT CGGCCGGGGC
GCGGCGCCGC GGGAGGTGGC CGAGCAACTC GCCGTCGGCG AGAGCACGGT CCGCAGCCAG
CTCAAGGCGG TCTACGGCAA GCTGGCGATC CGTCGGCAGA GCGAGCTGGC GGTCTTCATC
ACCCGTCTCG ACAGCCTCTG A
 
Protein sequence
MESGRHLSCR SMPGFDPLAL IDACYAAAVE PETWSDALDG IADAVGARGA VIVSHDPART 
AQTLASERIA DVNLDYGVGG WWQHDTRIRL GRERGIMRPG MVTVDEMYLT AEEKRADPFF
QHFIDRHRLA NLCAYVGADP MDRHTLSFSA SRDVRNGPFE GAEIERMGLV GPHVIRAFRL
TALVGEMRRE AEGLTSALER MRAGIVVLDV RGRVRLVSPV AERLAAGYLA LRTGAEPEAA
EPIDSARFGR FLADVLPERS WPRQETILLR RRGGGRPLYV EALPLRGADP FPVAGAGLGG
GVVLLLRDLL APSTRPIEPL LEQLGLTRAE ARVAALVGRG AAPREVAEQL AVGESTVRSQ
LKAVYGKLAI RRQSELAVFI TRLDSL