Gene Mext_3735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3735 
Symbol 
ID5833316 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4137101 
End bp4138132 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content69% 
IMG OID641369525 
Producthelix-turn-helix domain-containing protein 
Protein accessionYP_001641180 
Protein GI163853137 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.43988 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATCG CGGCCGGGGC CACATGCCCG GCAGGTGTTG CTGCGGCGAG CGAGGCCGTG 
ACCGGGAGAG CCATCTCCGT CGGTTTCATG CTGGTCGATC GCTTCTCGAT GATCGCCTTC
TCCTCGGCGA TCGAGCCCCT GCGGCTCGCC AACCGCGCGG TCGGCCGCGA TCTCTACCGC
TTCTCGCTGT GGTCGGAGGA CGGCACCAAG TGCACCGCCT CGAACGGGAT CGAGGTCAAG
GTCGCGGCCC GCTTCTGCGA TGCGCAGGAC TTCGACATGC TGGTCGTGTG CGGCGGCATC
GACATCCAGC ACCCGGATCA CCGCGCCCTC CATTCGGCCC TGCGCCGATG CAGCGCCCGC
GGGGCGGCGA TCGGCGCGGT CTGCACGGGC ACCTACGTCC TGGCCAAAGC CGGCCTCCTC
GACGGCTACC GGGCGACGAT CCACTGGGAG AACCTGCCGG GCCTCGTCTC GGAGCATGAC
GGGCTGGAGA TCGGCTCGGA CCTGTTCGAG ATCGACCGCA ACCGCTTTAC CTGCGCCGGC
GGCACCGCAG CGGCCGACAT GATGCTGTCG CTGATCGTGC GCGACCACGG GCCGAGCGTG
GCCTCGGAGG TCGCCGACCA GCTCATCCAC CACCGCATCC GCGAATCCGG CGAACGCCAG
CGGATGGACC TGCGCATGCG CCTCGGGGTC TCGCACCCGA AACTGCTGCG GGTGGTGGGG
CTGATGGAAA CCTCCCTCGC CGAGCCGCTC GGCAGCCAGG AACTCGCCGA CGCGGTGCAG
CTCTCGACCC GCCAGCTCGA ACGGCTGTTC CTGAAGTATC TCGGCCGCTC ACCGGCCAAG
CACTATCTGC GCATCCGTCT GGAACACGCC CGCAACCTGA TCCGCCAGAC AGCGATGCCC
CTACTTTCGG TCGCGTTCGA GTGCGGCTTC ACCTCGGCCT CGCACTTCTC GAAGGCCTAT
CTCGACTGCT TCGGCCAGCC GCCGAGCGCC GAGCGGAAGC TGGTGCAGAC GCAAGGCAGC
GTGCGCGCCT GA
 
Protein sequence
MTIAAGATCP AGVAAASEAV TGRAISVGFM LVDRFSMIAF SSAIEPLRLA NRAVGRDLYR 
FSLWSEDGTK CTASNGIEVK VAARFCDAQD FDMLVVCGGI DIQHPDHRAL HSALRRCSAR
GAAIGAVCTG TYVLAKAGLL DGYRATIHWE NLPGLVSEHD GLEIGSDLFE IDRNRFTCAG
GTAAADMMLS LIVRDHGPSV ASEVADQLIH HRIRESGERQ RMDLRMRLGV SHPKLLRVVG
LMETSLAEPL GSQELADAVQ LSTRQLERLF LKYLGRSPAK HYLRIRLEHA RNLIRQTAMP
LLSVAFECGF TSASHFSKAY LDCFGQPPSA ERKLVQTQGS VRA