Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_0385 |
Symbol | |
ID | 5742253 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 491883 |
End bp | 494183 |
Gene Length | 2301 bp |
Protein Length | 766 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 641291497 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001557511 |
Protein GI | 160878543 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTATTAC GCAATTTTAA GTTGAAAAAA TCAAGTGTAT ATGTGGTTTT TTTCTTTTCG TATATCTTAA TTTTAATTGT TGCTTTAAGT TGTGGTTTTG CTTATTATGC ACAGATAACG GCGAAAATTA CAAAACAAAC CGAGAATACG AAGCAGTTAT TATTAACAGA ACTCCGTACC AGTGTGGAAT CAAGTACACA ATACATTGAG GAATTATGCA ATGAATTCAC GTTTAATACG AAGCTGGAAC AATTTGCGAA AGGATTACCC TCTGTTACAC TGCATGAGGC AATGCAGGAG TTGACATTGC GACAAAAGCC AGGAGCCCTT CTATTCGACT ATTTTCTGTA CATAAAAGAA ACAGATGAGA TTATCACACC AAACATACGG ATGAAGGCAG ACAAATTTTT TGATATTATG TATTCCTTTG AGGATATAGA TTATAAAGAA TTTGCAGAAC AATTAAAAGG AAATTATTTT AAAAATTATT TACCAATAAT GAAAGTAAAT CAATATGACA AAAATCCGGT GGAGATATTG CCTTTGATCC AAACATTCCC GGTTGGAACC AAGCAGAAGC CACTTGGTCA AGTCATCATA TTCATTAATG CGGAAAAATT ATTTTCAATG GTGGAACAGT TCCATGTTGC GACAGAATCA GATGTCTATG TCTATAATAA AAACAATGTG TGTATATTGT CTAGTTCCGG GGCCCCTGAG TTACCGGTAG AACTAGTCAC TGATAAATCA GCAGGTAATA ATACAAAGGA CAGTATTGTA TTTCGACAAT TATCAGATAC CTTAGGCTGG AAGTTCATCA TTTCTACACC AAAGGATTTA TTCTATTCCG AAAACTACGA ATTTTTACTT CGGATGGTTG GGGTTGCAGT TGTGTATCTG ATAATCGGTA TCGTCTTAGT TTCTTTCCTC GCGGAATACA GCTATAAACC AATTCGTGAA ATTCGAAAAT ATATTGATAA GAATACGAAA GTGGACGGAA TCGTAAAAAA TGAATATGAG GTTATTAAGA CTACCTTAAA AGAGCAAATT ACCAATGGAA GAGAATTATC AGATCTACTT GAAAGTCAAA AGCCTATTGT AAAGCGTGAG GTACTTGGTA AGTTACTGCA TGGAATGGTT ACCGATTATA GTAGCTTAAA GGAGCGTTTT GTTCCGCTTG GAATCACTTT TACAACGAAT CTATTTTATA CGGTTGCCCT TGAAGTATCT GAGGGATGTG AATTCTTTCG CATGGATAAA GCAGAAAATG ATCAGAGTAT GATTCTAGCA AAATTTATCT CAGAGAATGT CGGAGTGGAT TTATTTCAAG AGAAATTTAA CTACTACTAC CTTGATATGG GGCAAAATAC AGCGGTTCTA TTGTTAAATC TTCGTAAGGA AGAAGAACTT GAAGAAGCAG ATGAGATCGT GTATCAAAAA GTATCGGAGC TAATTCAATT TCTGAAAAAG TATTATTCTC TATATATTTA TACTGGCATT AGTAAACAGC ACCATACGCT AAAAGGAATT CAGCTTTGCT TTGACGAATC CAGAAAGGCA CTGGAGCAAC ACAAGCTATA CGGAGGATTT GAACCATATT GTTTCTATGA ATTAGAGAAT CTGGAAGCGG ATTATTATTA CCCGAGTGAG ATGGAATATC AATTATTACG CTATATTAAA ACTGGAGAAA GTGATAAAGC AAAACAATTA TTACAAAGTA TTCTTGATAT CAATGCACGT AAGAAAAAAA TTTCGACTAG TGCAGCAAGA GGACTATTAT TTGAAATTTC AATCAGCTTA AAGAAGCTCT TTGATGGTGC GATGATATCA AGAGGTAAGG AGACAAGTAA CCTTCTTGAT TTAGATAATT ATCTAAAGAC ACCGAATCTT TTGGTTGCAT TCGATGATTT TTGTTCTTTG ATTGATTGGT ATCAGAAAGA ACGAGCAGAG GTACCGGTTA GTAATAAAAC AAAAAGATTG GTGGACTCCA TAGCAGAGTT TATTGAACAA AATATTGGTG AGAACTGGCT TGACTTAAAT GGACTTTCTT CAGAGTTCGG GGTAACACCT CAGTATATCT CAAACATATT TAAGAAGTAT AAAGACGAGA ACATTAAGGA CTATATTTCG AAGCTTCGAT TGGCTAAGGC AAAGGAATTA TTAAGAGATA CGGAACTTCC GATTAAAGAA ATAGCAAATC AACTGGGTTA TGTAGGTGAA ATTGGGGTAA TACGTTTATT TAAGAAGTAT GAGGGGATTA CTCCGGGAGA TTATCGTAAT CAGATTCATA ATGTAAATTA A
|
Protein sequence | MLLRNFKLKK SSVYVVFFFS YILILIVALS CGFAYYAQIT AKITKQTENT KQLLLTELRT SVESSTQYIE ELCNEFTFNT KLEQFAKGLP SVTLHEAMQE LTLRQKPGAL LFDYFLYIKE TDEIITPNIR MKADKFFDIM YSFEDIDYKE FAEQLKGNYF KNYLPIMKVN QYDKNPVEIL PLIQTFPVGT KQKPLGQVII FINAEKLFSM VEQFHVATES DVYVYNKNNV CILSSSGAPE LPVELVTDKS AGNNTKDSIV FRQLSDTLGW KFIISTPKDL FYSENYEFLL RMVGVAVVYL IIGIVLVSFL AEYSYKPIRE IRKYIDKNTK VDGIVKNEYE VIKTTLKEQI TNGRELSDLL ESQKPIVKRE VLGKLLHGMV TDYSSLKERF VPLGITFTTN LFYTVALEVS EGCEFFRMDK AENDQSMILA KFISENVGVD LFQEKFNYYY LDMGQNTAVL LLNLRKEEEL EEADEIVYQK VSELIQFLKK YYSLYIYTGI SKQHHTLKGI QLCFDESRKA LEQHKLYGGF EPYCFYELEN LEADYYYPSE MEYQLLRYIK TGESDKAKQL LQSILDINAR KKKISTSAAR GLLFEISISL KKLFDGAMIS RGKETSNLLD LDNYLKTPNL LVAFDDFCSL IDWYQKERAE VPVSNKTKRL VDSIAEFIEQ NIGENWLDLN GLSSEFGVTP QYISNIFKKY KDENIKDYIS KLRLAKAKEL LRDTELPIKE IANQLGYVGE IGVIRLFKKY EGITPGDYRN QIHNVN
|
| |