Gene Mnod_4868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_4868 
Symbol 
ID7303808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp4956343 
End bp4957440 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content65% 
IMG OID643602511 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002500031 
Protein GI220924729 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.608166 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTCCT TTGTCTTGTC GGGTTGCAAT CGATCTCTGA GCGCCACGGC AGGCGCCGAA 
GAGCTGCGCG AGCTCGCGCA CGGACCCCCC ATTGATTTCT TGCACATCAA CGCAGTTCAG
GCGATCTGCA GCGTCCTCAT AGATTTTGGC ATCGATCCAA ACCGGCTGTT CGAGCAGGAC
GGAATCAGTA CGTTGTTCCT CGACGGCACC GAAGTGATCT CATTCGCGTC GCTCGGCCGC
CTGACGGCTC TTGGTGCCCA TTGCAGCCAA TGCCCCCACT TCGGACTTCT CGTCGGTCAG
CGCACCACCC TCGCTTCGCT CGGCCTCCTC GGGGTGCTGA TGCGAAACTC GGAGACGATC
GGCGATGCCC TGCGCGCGCT GGAGGCTCAC CACGGCCTCA TGAACCGCGG AGCGGTGGTC
GGAGTGTCGA TCGACAGCAC CTTGGCGATC GTCAGCTACT CTCTCTATCA GCCCGATGCA
GAAGGCGTGG CCCTTCACTG CGAGAGAGCC CTTGCGGCCA TGACCAACGT CCTCCGGGCC
TTCTGCGGCG CGGATTGGGC TCCCGACGAG GTGCTGCTGC CGCGCTCGCA GCCCCCCGAC
ACGACCCCCT ACAGAGACTT CTTCCATGCC CACATCCGGT TCGAGGAGGA GATCGCGGCC
CTGGTTTTCC CGGCCCGGCT CCTGAAGCAC CCCATCGAGG GCGCGAATCC GGTCGCGCGG
AAGGTGGTGG AGCGGCGCAT CCAGCAGCTT GAGGCCGTCA TTCCGGCCGA CGTGACAGAC
GAGCTTCGGC GCCGCCTGCG CGCCACGATG ACCCAGAAGC CGCTCAGCGC GCAGCAGGTC
GCGCGCATGA TGGCGATCCA TCGCCGCACG CTGAGCCGCC GGCTGAAGTC CGAAGGCACG
AGCTTCAGGC TGGTTGCCAA CGAGACGCGG CTTGGCATCG CGAAGCAGTT GTTGGCCGAC
ACCACCCTGA GCCTGGCGCA GATCTCGGCC ACACTGGAAT TCTCGGAGCC GGCCGCATTC
ACGCACGCCT TCCGGCGCTG GACCGGCACA ACGCCGAGCG CTTGGCGGAA GGAAAATCAG
GCACAGGAAA AATCTTAG
 
Protein sequence
MSSFVLSGCN RSLSATAGAE ELRELAHGPP IDFLHINAVQ AICSVLIDFG IDPNRLFEQD 
GISTLFLDGT EVISFASLGR LTALGAHCSQ CPHFGLLVGQ RTTLASLGLL GVLMRNSETI
GDALRALEAH HGLMNRGAVV GVSIDSTLAI VSYSLYQPDA EGVALHCERA LAAMTNVLRA
FCGADWAPDE VLLPRSQPPD TTPYRDFFHA HIRFEEEIAA LVFPARLLKH PIEGANPVAR
KVVERRIQQL EAVIPADVTD ELRRRLRATM TQKPLSAQQV ARMMAIHRRT LSRRLKSEGT
SFRLVANETR LGIAKQLLAD TTLSLAQISA TLEFSEPAAF THAFRRWTGT TPSAWRKENQ
AQEKS