Gene Mnod_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_1040 
Symbol 
ID7302584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp1112754 
End bp1114814 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content75% 
IMG OID643598789 
Producttranscriptional regulator, histidine utilization repressor, GntR family 
Protein accessionYP_002496351 
Protein GI220921050 
COG category[F] Nucleotide transport and metabolism
[K] Transcription
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases
[COG2188] Transcriptional regulators 
TIGRFAM ID[TIGR02018] histidine utilization repressor, proteobacterial
[TIGR02022] formiminoglutamate deiminase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACGGC TCTGGTTCCG CTCCGCCCTT CTGCCCGACG GCTGGGCGGA GGGCGTCGCC 
CTCGACATCC GGGACGGACG GATCACGGCG GTGACCCGGG GCGTGCCCGC CGCCCCCGGT
GACGAGACCG GGGTAATCGG GCTTCCGGGC CTGCCGAACC TGCACAGCCA CGCCTTCCAG
CGCGGCATGG CCGGCCTCGC CGAGCGGCGG GGTGCCGGCC TCGATTCCTT CTGGACCTGG
CGCGAGGTGA TGTACCGGTT CGTCGACCGC ATGCAGCCGG ACGACGTCGA GGCGGTGGCA
GCGCAGGCCT ATGTGGAGAT GCTGGAGGCG GGCTTCACGC GCGTCGGCGA GTTCCACTAC
CTGCACCACG ATCCGGCGGG CGCCCCCTAT GCCGATCCGG CCGAGATGGC CTCCCGCATC
GCGGCGGCGG CTGACGCTAC GGGCATCGGC CTCACCCTGC TCCCGGTCTA CTACGCCCAT
GGCGGCTTCG GCCCGCTGCC GCCGAGCGCC GGGCAGCGCC GCTTCGTGAC CGACCCCGAG
CGTTTCGCGG ACCTGATGGC GGCAAGCCGC CGGGCCGTGG CGGGCCTCCC CGACGCCGTG
GTCGGGGTCG CGCCCCACAG CCTGCGGGCC GCGACCCCGG CGGAGGTCGA AGCGCTCCTG
CCGCTCGCGG GAGGTGGGCC GGTCCACATC CATGTGGCCG AGCAGGTGCG GGAGGTGGAG
GAGTGCCTTG CCGCCACCGG CGCGCGCCCG GTCGACCTCC TCCTCGACCA TGCCCCCGTC
GATGGGCGTT GGTGCCTGAT CCATGCCACG CACTTAAGCG AGGCCGAGAC CCCGCCGGCT
CGCGCGGAGC GGCGCGGTCG CGGGGCTCTG CCCGATCACC GAGGCCAATC TCGGCGACGG
CCTGTTTCCG CTTGCCGCCT TCGCCCGCGA GGGCGGGCGG TTCGGCATCG GCAGCGACTC
GAACGTGCTG ATCTCGGCCG CGGAGGAGAT GCGGCTCCTC GAATACGGCC AGCGCCTCGC
GGGGCGCGCG CGCAACATCG CGACCTCGCC GCACTCCCTC TCGACCGGGC GGGCGCTCGT
CGAGGCCTGC CTCGCCGGGG GAGGGCAGGC GCTCGGGGTG GCCTGCGGGC TCGCGGCGGG
GTGCCTCGCC GACATCGTCG GCCTCGATCC CGACCATCCG GCCCTCGCGG AGCGCGCCGG
CGATGCGTGG CTCGACGGCT GGATCTTCGC CGCCCGCGAC GGCGCGGTGG AGAGCGTCTG
GCGGGCGGGG CGGCGGGTCG TGGCGCAGGG CCGGCACCTG AAGCGCGAGG CGGTGGCGGC
ACGCTTCCGG TCCGCCCTGG CAAGGCTCGC CGCGTGACCC GGGCGGCGAC GCTCAACGCG
CGCATCCGCG GCGACCTCGA AGGGCGGATC CTGTCCGGCG AATGGCCGCC GGGCCATCGC
ATCCCCTTCG AGTCGGAGCT GTCGGCCCTC TACGGCTGCT CGCGCATGAC CGTGAACAAG
GTGCTGGCCG GCCTCGCCGA GGCGGGGCTG ATCGAGCGCC GCCGCCGGGC CGGCTCGTTC
GTGGCCCGGC CCGCCCGGCA ATCGGCCGTG CTGCAGATCC CCGACATCCC GAGCGAGATC
GAGGCCCGGG GTGCCCGCTA TGCGCTCGAA CTCGTCGCCC GGCGCGAGCG TCCGGCGGGG
GCGGCGGAGC CGGCGGAATC CGGCTTCACG CCCGGCCAGC CTCTCCTCGA CCTCACCTGC
CGCCACCTTG CGGACGGGCG TCCCTTCGCC TGGGAGGAGC GCCTGATCAG CCTCGCGGCC
GTGCCGGCTG CGGCCGCGGC CGATTTCGCC CGGGTGCCGC CGGGCACCTG GCTCCTGCGC
CACGTGCCCT GGACGGAGGC CTCGCACCGC ATCACGGCGA TCAACGCCGC AGCGCGCCTC
GCTCGCGCCC TCGACCTGCC GCTCGGCGGC GCCTGCCTCG CGGTCGAGCG CCGCACGTGG
CGCGGCGGCG AGACCCTCAC CTATGTCCGG CAGATCTTCC GGGGCGACAC CTACAGCCTC
GGAGCGCGGT TCTCGCCCTG A
 
Protein sequence
MARLWFRSAL LPDGWAEGVA LDIRDGRITA VTRGVPAAPG DETGVIGLPG LPNLHSHAFQ 
RGMAGLAERR GAGLDSFWTW REVMYRFVDR MQPDDVEAVA AQAYVEMLEA GFTRVGEFHY
LHHDPAGAPY ADPAEMASRI AAAADATGIG LTLLPVYYAH GGFGPLPPSA GQRRFVTDPE
RFADLMAASR RAVAGLPDAV VGVAPHSLRA ATPAEVEALL PLAGGGPVHI HVAEQVREVE
ECLAATGARP VDLLLDHAPV DGRWCLIHAT HLSEAETPPA RAERRGRGAL PDHRGQSRRR
PVSACRLRPR GRAVRHRQRL ERADLGRGGD AAPRIRPAPR GARAQHRDLA ALPLDRAGAR
RGLPRRGRAG ARGGLRARGG VPRRHRRPRS RPSGPRGARR RCVARRLDLR RPRRRGGERL
AGGAAGRGAG PAPEARGGGG TLPVRPGKAR RVTRAATLNA RIRGDLEGRI LSGEWPPGHR
IPFESELSAL YGCSRMTVNK VLAGLAEAGL IERRRRAGSF VARPARQSAV LQIPDIPSEI
EARGARYALE LVARRERPAG AAEPAESGFT PGQPLLDLTC RHLADGRPFA WEERLISLAA
VPAAAAADFA RVPPGTWLLR HVPWTEASHR ITAINAAARL ARALDLPLGG ACLAVERRTW
RGGETLTYVR QIFRGDTYSL GARFSP