Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mnod_4868 |
Symbol | |
ID | 7303808 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium nodulans ORS 2060 |
Kingdom | Bacteria |
Replicon accession | NC_011894 |
Strand | + |
Start bp | 4956343 |
End bp | 4957440 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643602511 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002500031 |
Protein GI | 220924729 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.608166 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTCCT TTGTCTTGTC GGGTTGCAAT CGATCTCTGA GCGCCACGGC AGGCGCCGAA GAGCTGCGCG AGCTCGCGCA CGGACCCCCC ATTGATTTCT TGCACATCAA CGCAGTTCAG GCGATCTGCA GCGTCCTCAT AGATTTTGGC ATCGATCCAA ACCGGCTGTT CGAGCAGGAC GGAATCAGTA CGTTGTTCCT CGACGGCACC GAAGTGATCT CATTCGCGTC GCTCGGCCGC CTGACGGCTC TTGGTGCCCA TTGCAGCCAA TGCCCCCACT TCGGACTTCT CGTCGGTCAG CGCACCACCC TCGCTTCGCT CGGCCTCCTC GGGGTGCTGA TGCGAAACTC GGAGACGATC GGCGATGCCC TGCGCGCGCT GGAGGCTCAC CACGGCCTCA TGAACCGCGG AGCGGTGGTC GGAGTGTCGA TCGACAGCAC CTTGGCGATC GTCAGCTACT CTCTCTATCA GCCCGATGCA GAAGGCGTGG CCCTTCACTG CGAGAGAGCC CTTGCGGCCA TGACCAACGT CCTCCGGGCC TTCTGCGGCG CGGATTGGGC TCCCGACGAG GTGCTGCTGC CGCGCTCGCA GCCCCCCGAC ACGACCCCCT ACAGAGACTT CTTCCATGCC CACATCCGGT TCGAGGAGGA GATCGCGGCC CTGGTTTTCC CGGCCCGGCT CCTGAAGCAC CCCATCGAGG GCGCGAATCC GGTCGCGCGG AAGGTGGTGG AGCGGCGCAT CCAGCAGCTT GAGGCCGTCA TTCCGGCCGA CGTGACAGAC GAGCTTCGGC GCCGCCTGCG CGCCACGATG ACCCAGAAGC CGCTCAGCGC GCAGCAGGTC GCGCGCATGA TGGCGATCCA TCGCCGCACG CTGAGCCGCC GGCTGAAGTC CGAAGGCACG AGCTTCAGGC TGGTTGCCAA CGAGACGCGG CTTGGCATCG CGAAGCAGTT GTTGGCCGAC ACCACCCTGA GCCTGGCGCA GATCTCGGCC ACACTGGAAT TCTCGGAGCC GGCCGCATTC ACGCACGCCT TCCGGCGCTG GACCGGCACA ACGCCGAGCG CTTGGCGGAA GGAAAATCAG GCACAGGAAA AATCTTAG
|
Protein sequence | MSSFVLSGCN RSLSATAGAE ELRELAHGPP IDFLHINAVQ AICSVLIDFG IDPNRLFEQD GISTLFLDGT EVISFASLGR LTALGAHCSQ CPHFGLLVGQ RTTLASLGLL GVLMRNSETI GDALRALEAH HGLMNRGAVV GVSIDSTLAI VSYSLYQPDA EGVALHCERA LAAMTNVLRA FCGADWAPDE VLLPRSQPPD TTPYRDFFHA HIRFEEEIAA LVFPARLLKH PIEGANPVAR KVVERRIQQL EAVIPADVTD ELRRRLRATM TQKPLSAQQV ARMMAIHRRT LSRRLKSEGT SFRLVANETR LGIAKQLLAD TTLSLAQISA TLEFSEPAAF THAFRRWTGT TPSAWRKENQ AQEKS
|
| |