Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_1706 |
Symbol | |
ID | 5741537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 2092890 |
End bp | 2095226 |
Gene Length | 2337 bp |
Protein Length | 778 aa |
Translation table | 11 |
GC content | 27% |
IMG OID | 641292806 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001558817 |
Protein GI | 160879849 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0404668 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGAAAATA CAAATCAAAA CTTTAAGAAT AATAGTTTCT TTAAGAAAGT ACTTTTTAGT ATTATAACAC TGATAATTAT AATAATACTA ACCTATGCGA CTGTTACATA CATAGGGATG AAAAAAACTA TTTTAGATGT GAAAAATTCT GCCAATATGA ATGAGTTGGC ACAAGCGAAT AGTACAATAA ATTATTTATT TGAAATGACT AAAAATTTGG CCTTATACAT TTACCAAGAT GAGGATTTGG TTAAGCTTTT GCATATTGAA GATAAAGAAT TTTTAAATAG TTTAGATTAT ATAAAGTTAC GAACCAAATT GAACACCTAT ACCCATACAT TTGAGTTCTC AGATAGTATT ATTATATACA ATAGCAAATT AGATTTGATA ACTTCTACGG AATATTCCAT TCAAAACTAT GATAAACCAT TGGCGAATGC CATTAAAAAA TATATTAACA ATGATATGAA GGAATATGCA GAGTTTGCTA TTTTAGACTA TTTGGAGGAA GACGGTAAAA AGAATACTGC GTTTTTGTTT GGCTTAAAGG ACTGGAAGTT TATTACTCCG GATAATCAAA CAACGATTGG TATTCTAATT AAACCAGAAT GGTTATTCGA TAACTTAGAG ATTATTAATA AAGCTGATGC AAATCAAGAA AAAGAAATAT TTATATTGGA TAATAAAGGG GAATTATATA GTTCCGATGC AAAGTTAGTA AAAGATGAGA GTTTAAATGA ATTAAAAAAT ATAGTAATGA ATCAACCTCA GAATAGTGAC TACTTTGATG TTTATATACA TAATGTAAAA TATAAAGCCA CCTACATAAA AAATGAAATG TGTAAATGGA AAATTATATC GATTCAGCCT TATAATGTGT TTATGTCACA GTTATATAAA ATTACGGGAA TATTTATCGC CATAACATTT ACTATCATTT TAATAGCCTT AGGATTAGCT TTTATTTTTA CAAAACATAT TTATGTGCCT GTGAATAAAG TGGTAAAACA ATTTATCCAT AAGAATAAAA ACATAAGTAA TGATTTAAAA ATTGAAGACG AACTAGGTTT TATTGTAAAG AGCTACGAAA ATGCTGTATC CCAGATTTCT ATACAACAAA GTGATTTAAA AAGTTCTAAA AAATATATAA GAAGTTATTG GATAAAAAGA CTCCTAATGG AAAGTAAAAT GCTCTCTTTA GAGGAGTTGA AGAAAAATGA TGTAGAAGAA TTATTGAATG TCAATTTATT AGAAGAGTTT ATCATTATTA TTCTAAATAT CGATGAAAAT GGAGAATTCA ATAGACATAC TATTGAAAAT CAACGTATAT ACCGCTATGC TATTGAAAAT ATTTCTCAAG AAGTTATAGG TGAATCATTT CCTTGTAATA TAGTCGATAT GGGTGAGGAG AACCTTGTAG TTTTAGTCAG CCTAAAAGAT ACAAAGGGAG TAGTAGCTCA AATAGAAGAA TGTGTCAGAA AAATACAGCA GACAGTTGTT ACTTATTATG AATTCTCTCT ATCAGCAGCA ATTTCTGATA AAATAGAAAA TTATAGCGAT ATTTCTAAAA GCTATAAAAA AGCTTTACAT TTATTAAGTT ATAAATTGAT CTATGGAAAT GAATGTATGA TCAAAGAATC AATGCTAGAA GAAATATTCG AATCAAATAA AGAAATGTTT ATTTTACAGA AAGAGCAAAA ATTAGAAGGT CTTTTCATAA GTGATAAGGA AGAGTTATTC AGAACAGAAA TAAATGAAAT ATTTGAGACG ATTAAAAACA TGAAATATAG TGAAATTATG AGTAGTATTA ACTACTTAAG CTTTTTGTTC TATAAGATTA TTAAGATAAA CTTTCCAATG CAGTTCAATG AACAAATAAA GCAAATGAAT ATATTAAATA AGAATATATT TAACAGTGCT AGCTTGGAAG AAATTAAAGA AATTTTTATT GAAATTTATA TCAATATTCA TAAAAATCCA CAAGAGAATA TTTCAACCAC CAATAACATG TTAGTAAGTA CTATTATTCA AATTATTGAA GAGAATTATA AAGATCCTAA TTTGAGCCAG GAGTGGATTG CATCTACTTT AAAGCTTTCT TATAGTAATG TCGGAAAGGT ATTTAAATTA GTTGAGAAAG TTTCGATAGC GGAATATGTG AATAAGGTTC GGTTAAAATA CGCTTGTGAG TTATTGGAAA ATACGAATTA TAGTATAAAT GACATTTTTA ATAGTGTAGG GTTTGTAAAT CAAAGTTATT TTTTCACCTT ATTCAAAAAA TATTTTGGAT GTACTCCGAA GCAATATCAA TTACAGAAGA AGTTTAAACA ATTTTGA
|
Protein sequence | MENTNQNFKN NSFFKKVLFS IITLIIIIIL TYATVTYIGM KKTILDVKNS ANMNELAQAN STINYLFEMT KNLALYIYQD EDLVKLLHIE DKEFLNSLDY IKLRTKLNTY THTFEFSDSI IIYNSKLDLI TSTEYSIQNY DKPLANAIKK YINNDMKEYA EFAILDYLEE DGKKNTAFLF GLKDWKFITP DNQTTIGILI KPEWLFDNLE IINKADANQE KEIFILDNKG ELYSSDAKLV KDESLNELKN IVMNQPQNSD YFDVYIHNVK YKATYIKNEM CKWKIISIQP YNVFMSQLYK ITGIFIAITF TIILIALGLA FIFTKHIYVP VNKVVKQFIH KNKNISNDLK IEDELGFIVK SYENAVSQIS IQQSDLKSSK KYIRSYWIKR LLMESKMLSL EELKKNDVEE LLNVNLLEEF IIIILNIDEN GEFNRHTIEN QRIYRYAIEN ISQEVIGESF PCNIVDMGEE NLVVLVSLKD TKGVVAQIEE CVRKIQQTVV TYYEFSLSAA ISDKIENYSD ISKSYKKALH LLSYKLIYGN ECMIKESMLE EIFESNKEMF ILQKEQKLEG LFISDKEELF RTEINEIFET IKNMKYSEIM SSINYLSFLF YKIIKINFPM QFNEQIKQMN ILNKNIFNSA SLEEIKEIFI EIYINIHKNP QENISTTNNM LVSTIIQIIE ENYKDPNLSQ EWIASTLKLS YSNVGKVFKL VEKVSIAEYV NKVRLKYACE LLENTNYSIN DIFNSVGFVN QSYFFTLFKK YFGCTPKQYQ LQKKFKQF
|
| |