Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_0342 |
Symbol | |
ID | 5742188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 432371 |
End bp | 434707 |
Gene Length | 2337 bp |
Protein Length | 778 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 641291432 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001557468 |
Protein GI | 160878500 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGGAAA AACAAGGTAG CATAAACTAT AAAAAGCAAA TATTCATAAT TATTATGACA GGTGCATTAT TGCCGGTAAT TTTATTGGCC TTGTACTTTT ATTATACCTA TATTCATGAA GTGAGTGACA AAATCGATTT ATCGAATGCC AAAGCGTTAG AACAAGTTAG GAACAAAGTA GAAAATGTAA CAGAGACGAT GCAATTAAAC TACATGTCTT CCCTTTGGAA GGGTGATTAT GTAGTGAAAG AGGATTCTAT CGATTGGTTT ATGAAAAACA TTGTTTCTTA CAGCGATTAT TCTAATTTAA AGAAAGTACA GGATGAACTC TTAGGTGCAA CCTACTTAAG GATCTATATC AATAGTTTTT ATTTTATTAA TTTCTCAACT GGTTGGGTTT TAACAAGCAA TGGTATGTAT CCTTTAAATG AAGTGGTAAA TCAAGAACAA TTGTATGATC TTTTCTATAC TGGGATGAGT GACTGCAAGT GGACATATCT AAGTATAGAA AATGAAGGGA AGGGAAATCG TGTTATTAAA TCCGACTATA TTACGCTAGC GATTAAGGCT CCTCTCAACC TTAAAAGCCC AAATTGCTTG TTGTTAGTTA ATTTAAGTAG CGTAGGGTTT GAAAGACTTC TTAGAAATAG CTTAAGCAGT GGAAAATTGA CCGTTTTTGA TGCACAGGGA AAACTCTTAT TCACAGAGCA TAAAGAAGTT TCAGAATTTC TTGATGAGTA CATAAAAGAA CATAATACAA CAATAGAATC CTTGGTAGAA AGGCGAGATG AACTTAGTAA GAAGAGTGGT TTTAATTTGA ACTTCTCCTA TTCTGGAGCC AATGGATGGC TTTATGTATC CAGTTATGAT CCAGCTATTG TGACTCGTGA GGCAAGTTCT ATTATAATAA CTGCCGGACT CTTAATACTA TTTATGATGA CACTTATTTT TCTTTTCTCT GTCGTAGGTT CTCGAACAAT TTATCGTCCA GTATCTCAGT TGGTAACAAA TCTAAGAAAT ATAGAGGAAA TAGAAGAGGT AGCAGGACAA AAAGAGAATG AATTCAAATT TATCGGGAAA CGAATTAATC GATTAGTTGA TTCAAAAAAT GATATGAAAG AATTAATACA GGTTCAACAA AAGCAGTTAA AGGAATTGTT CGTCCTTCGT CTATTAAAAG GAGAAGTAAG AGAAGAGAAT ATTGAATCTA ATTTAAAAAA CTTTGGATTC GAAAAAATGA GATATATTTC AATGATTACG ATTAGTGTTT CATCGAAAGT AATAGAGGAA GACTGGGACT ATACAAAACA AGATATGATA AAAATAAGTA TCGTGGAAAA CATGCCGGAA TTTATTAGTG ACCAGCTTTT ATTTCCTACC GTTACAAATT CAAATGTAAT TCTTTGCACA ATTACGAGAG AGTTAAAAGA AGAACTTGAA GCGGCATCAA TTGAATTGAT TCGCCACCTT ACCGATTATA TCGTGATGGA AAGTGGATAT CATGCAAACT TTGGCATCAG CCAACCTTTT GACAGCCTTA TGAAATTCAG ACAGGCTTAT AATGAAAGTT TAGAAGCGGT CAAAAATAAT GAAACCTTAA ATAGAGAAAA TACCATATGT GAGCAGCAAA ACTTTATATT TTATTCTGAT ATAGCGAATA GTAACTCGAA TAACTATACC TATGAACTTA TTTTAGAGAA AGAGATGAAG GCAGCAGTGG ATGAGTGCGA TAAAGAGAAA GCATTTTTAT TGGCAGATAC TTTTATTAAC CACATGGCAG AAAGCGGTGC TGTTCTAAAT GAGCTTCATT TTTATCTTCA TCGATTCATG GTAGCAGTAA TATTAGTAGC AACAGATGCC GGTATTTCGG TAAATGATAT TTTGGGTCAT GGATCGGTAA ATGTATTTTT GCAATTTAAT CAACTTTCGG ATTTAGATAA AATTCGAAGC TTTTATAAGC ATAATATTAT TTCACCGGTG ATTAAACAGT TAAGTACATT CCGAAGAAGT AATTCTGATC TTGTATTAGA TAAGATAGTT AAGTTGGTGA GAGAAACCTC TAGCGATATC ACATTAACAG AATGTGCAGA TCAACTGGGG TATCATCCAA GTTATATTTG GAAGATTATG AAGAGTAAAA TTAATATGAC GTTTACCGAT TATGTAACAA TACAAAGACT TGAAATTGCG AAGAAAATGC TATTAGAAAC GGACAAATCG GTAGCAGAAA TTGCAGAACA ATTAAAATAC ACCAATGCAC AGAACTTTAT ACGTTTTTTT AGTAAACATG TTGGGACAAC ACCGGGAAAA TTTAGACTGA TGGACAGAAA TTCATAA
|
Protein sequence | MEEKQGSINY KKQIFIIIMT GALLPVILLA LYFYYTYIHE VSDKIDLSNA KALEQVRNKV ENVTETMQLN YMSSLWKGDY VVKEDSIDWF MKNIVSYSDY SNLKKVQDEL LGATYLRIYI NSFYFINFST GWVLTSNGMY PLNEVVNQEQ LYDLFYTGMS DCKWTYLSIE NEGKGNRVIK SDYITLAIKA PLNLKSPNCL LLVNLSSVGF ERLLRNSLSS GKLTVFDAQG KLLFTEHKEV SEFLDEYIKE HNTTIESLVE RRDELSKKSG FNLNFSYSGA NGWLYVSSYD PAIVTREASS IIITAGLLIL FMMTLIFLFS VVGSRTIYRP VSQLVTNLRN IEEIEEVAGQ KENEFKFIGK RINRLVDSKN DMKELIQVQQ KQLKELFVLR LLKGEVREEN IESNLKNFGF EKMRYISMIT ISVSSKVIEE DWDYTKQDMI KISIVENMPE FISDQLLFPT VTNSNVILCT ITRELKEELE AASIELIRHL TDYIVMESGY HANFGISQPF DSLMKFRQAY NESLEAVKNN ETLNRENTIC EQQNFIFYSD IANSNSNNYT YELILEKEMK AAVDECDKEK AFLLADTFIN HMAESGAVLN ELHFYLHRFM VAVILVATDA GISVNDILGH GSVNVFLQFN QLSDLDKIRS FYKHNIISPV IKQLSTFRRS NSDLVLDKIV KLVRETSSDI TLTECADQLG YHPSYIWKIM KSKINMTFTD YVTIQRLEIA KKMLLETDKS VAEIAEQLKY TNAQNFIRFF SKHVGTTPGK FRLMDRNS
|
| |