Gene Mext_0034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_0034 
Symbol 
ID5832931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp36123 
End bp37223 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content67% 
IMG OID641365818 
Producthelix-turn-helix domain-containing protein 
Protein accessionYP_001637533 
Protein GI163849490 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.131953 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTCA CAGCCAACCT CACCGCGACG AAGCAGGAGG TGCGTTCGCG CGCGGCGGCG 
CCGTCCATGC CCCCGAGCGC CACGGGCGCG CTAGCCCGAC TCGCGGTGGC TCGCGCGGCT
GCGTCAGGGA TTGATGTCCA GCCGTGCCTG TCCAGGGCCG GTCTGAGCGA GCGCCAGATC
AAGAACCGGC ACGCTCGCAT CGGTGCGACA AACCAAGCCG CGCTCGTCGG TCTGCTGGCT
GAGGCGCTCG AGGACGACCT CTTCGGTTTT CATCTCGGCC AAAGCTTCGA GCTCGGTGAG
ATTGGGCTGC TTTATTACGT GATGGCCTCC GCCCCAACGC TGCGTGACGC ACTCTGTCGG
GCGGAGCGCT ACGCCGCGAT CACGAACGAG GGTATCGCCC CGATCTATAG CCAGAGCGGT
GAGGTCCGCG TCTCGTATGT CGGGCTGGCT CGGCACGCTG CACGGCATCA GGTCGAGTTC
TGGATGACGG GTCTCGTCCG GGTCGCCCAG CAATTGACCA GCCTGCGACT ATCGCCAATC
CACCTGACCC TGTGCCACCC ACGTCACGCG GGAGCCCGCG AGATCGAGGC CTTCCTCGGC
TGCGCCATTG CGTTCGATGC CCTGGTGGAC GAGGTTCAAT TTCCCTCTGC CGCGGGGAAC
GCGGTCCTGA CCGGCGCCGA CCCCTACCTG CACGATCTCC TGCTCGGGTA CAGCGAAGAA
GCGCTCGCTC ATCGTGTCCG TTTGGCGGAA AGCCTACGTA CGCGGGTGGA GAACGCGGTG
ATGCCGCTCC TGCCGCATGG TCGGCCGCGC ATCTCCGAGA TTGCACGGGC ACTGGGCACA
AGCCAGCGAA CCCTGTCCCG CCGCTTGACC GAAGAGGGGC TCAGCTTCGA GAGCGTGTTG
GAAGAGATGC GGCGGGACCT TGCCCTGCGC TACCTTCGGG ACACGCGTCT CTCGATCTCG
CGCATCGCTT GGTTGCTGGG GTTCCGGGAG GCCACCGCCT TCACCCACGC CTTCCGGCGC
TGGACGGGCC GATCGCCGAC GGAAGCGCGG GTAGAGCGGG ACCGGGCCCT GAACCCGTCG
CCTCCGCAGC GGTCCAGTTA A
 
Protein sequence
MPFTANLTAT KQEVRSRAAA PSMPPSATGA LARLAVARAA ASGIDVQPCL SRAGLSERQI 
KNRHARIGAT NQAALVGLLA EALEDDLFGF HLGQSFELGE IGLLYYVMAS APTLRDALCR
AERYAAITNE GIAPIYSQSG EVRVSYVGLA RHAARHQVEF WMTGLVRVAQ QLTSLRLSPI
HLTLCHPRHA GAREIEAFLG CAIAFDALVD EVQFPSAAGN AVLTGADPYL HDLLLGYSEE
ALAHRVRLAE SLRTRVENAV MPLLPHGRPR ISEIARALGT SQRTLSRRLT EEGLSFESVL
EEMRRDLALR YLRDTRLSIS RIAWLLGFRE ATAFTHAFRR WTGRSPTEAR VERDRALNPS
PPQRSS