Gene Mnod_1848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_1848 
Symbol 
ID7305878 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp1949447 
End bp1950463 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content75% 
IMG OID643599583 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002497141 
Protein GI220921840 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.172735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTACCGG CGCAGGCTTT GGCGGAGACG CCAAACATCC CTCGCTTCAT GCCAAACGCA 
CGCCGCATCG AGATCCTGGC CTTTCCCGAT GTCCAGCTGC TCGACGTGGC CGGGCCGCTC
CAGGTCTTCT CCACCGCCAA CGACATCGCG GCCGCAGGAG GCGCGCCGCT CCCCTATGCC
CCCACCGTGG TCGCGGCCGA GGCCTCGGTG ACGAGCACGG CCGGCCTCGC GCTCGCCACC
GAGCCCCTGC CCCCGGCCGA CGCGCCCCTG CACACGCTGA TGGTGGCGGG CGGGCGCGGG
GTCGATGCGG TGAGCGAGGA TCCGGCGCTG CTCGCCTGGG TCCGGCGCCG AGCCGATGCG
GCGATCCGCA CGGCCTCCGT CTGCAGCGGT GCCTTCGTGC TCGCCGGGGC GGGGCTCCTC
GACGGCCGGC GCGCGGTCAC CCATTGGGGT CGCTGCGCCC AGTTCGCCGC GCGCTTTCCC
GCCGTGCGGC TCGATCCCGA TCCGATCTTC GTCCGGGACG GCAGCGTCTG GACCTCCGCG
GGGGTCACGG CGGGCATCGA CCTCGCCCTC GCCCTGGTGG AGGACGATCT CGGCCGCGCC
ACTGCCCTGG CGGTGGCCCG CCAGCTCGTC ATCTTCCTCA AGCGCCCCGG CGGGCAGGCG
CAGTTCAGCA CGCTCCTCGC CCTCCAGGAG GCGGGACGCT TCGACCGGCT CCATGCCTGG
ATCGCCGAGA ACCTGAGGGC CGACCTCTCG CTCGCGGCCC TGGCGGACCG GGCCGCCATG
AGCGCCCGCA GCTTCTCGCG CCATTACCGG CAGGCGACCG GGCGCACGCC CGCGCGGGCG
GTCGAGGAGA TCCGGGTCGA GGCGGCCCGC CGGCTGCTGG AGCAGGGCGC GCCCGTGGCG
CGGGCGGCGG CCCAGTGCGG GTTCGGATCG GAGGAGACCA TGCGGCGCGG CTTCCTGCGG
GTGATCGGCA CCGGGCCGCG GGCCTATCGC GAGCGCTTCT CGGGGCGCTC CGCGTGA
 
Protein sequence
MLPAQALAET PNIPRFMPNA RRIEILAFPD VQLLDVAGPL QVFSTANDIA AAGGAPLPYA 
PTVVAAEASV TSTAGLALAT EPLPPADAPL HTLMVAGGRG VDAVSEDPAL LAWVRRRADA
AIRTASVCSG AFVLAGAGLL DGRRAVTHWG RCAQFAARFP AVRLDPDPIF VRDGSVWTSA
GVTAGIDLAL ALVEDDLGRA TALAVARQLV IFLKRPGGQA QFSTLLALQE AGRFDRLHAW
IAENLRADLS LAALADRAAM SARSFSRHYR QATGRTPARA VEEIRVEAAR RLLEQGAPVA
RAAAQCGFGS EETMRRGFLR VIGTGPRAYR ERFSGRSA