Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_1915 |
Symbol | |
ID | 5744594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 2362169 |
End bp | 2364496 |
Gene Length | 2328 bp |
Protein Length | 775 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 641293012 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001559023 |
Protein GI | 160880055 |
COG category | [K] Transcription |
COG ID | [COG2207] AraC-type DNA-binding domain-containing proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000415573 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGAA TTCCGATTGT AATTCAATTA ATTTTTATCC TGTTATTATT GTTCCTGATA CCAACAGTAA TTACCGGATA TTACAGCAAT GTACAAATGA TGAAGTATTC TGAAGAAGAA ATTGCATATT CTGCTATGGC TCAAATTGAT ACGAGCAGTT CTTATAGCGA AGCTATTTTA ATGAATATAG TTAAGAACAT ATTACAAATG GTTGGAACCA ACGAATTTAA TGGATTGAAA AACATTACTA CATATAAAGC GTTAAACTCA GAATACAGTA AAATAAAAGT TGCCACCGGT ATCTATGATC AGATGAAAGA GATTCAAAAT AACAATAAAA TTGTTAGTTC TATCTATTTT TGTCTCGATG ATGCTGACTA TGTTATCTCA ACGAACCGAG GAGTTGTTAA AAAACAAGCC TATGAAGATA TTTCATGGAT TAATGAAATG GATTTAAAAG CACTGGGTTC TGCTGGTTTA TGGTACCCAA GAACTTATAA CACTGCAACC GTTTCTGAAC TTACAAATAG TAAGTCAACT GGAGAAGTGA GAAATGTCAT TTCCTATATT TATCGCTTAA ATAAACTAAC GACATCAACC AAAGGTACCA TTGTTGTTAA TGTTGATGCA CAACGATTGA ATGAAATATT ACTTTCGAGT GTTAATTCAG ATGATGCACA GGGGATTATA GTAATGCCAG ATGGCACGAT TATATCGCAT AAAGAGAAGT CTAAATTTCT TAAAAAGTTT GAAGACATGG AATTTGTAAA AGATTCTTTA GCGAGCGGTA TAACAACAGG CTATCAATAT CAAAAGGATG GAGATCGTGC TATCTTATTA ACTTTTCAAA AATCATCACA ATTCCAGTGG ATTTATGTCA ACACTTATAA TATGGATACG CTGATGAGTA AATCAGATAG TGTGCGTAAT GGGTATACCA TATTTATCTC AATTATAATA GCTCTTGGGA CTATTGTAGT AATTGTATTA TCTAGAGAGT TTTCTAAACC AATGAGAAAG TTAGTTCAAA ATGTGAAACA GCTCAATGGT ATGGAGCAAC TTGGGGTGAA AAATGAACTT TCTTTTTTAA GTGGTGCAAT TGAAAAAATA CAGGAGCAAG AGAGTGAATT GCATCACCTT TTAAAGGAAA AGGAAGAGGA AGCTAGAAAC TTACTTCTTC ATAATTTACT AACTGGCGAA GTTACGAATC AAAAAGAAAT CGAAAATGTT GAAAAAATAT TTCCATACAA TCATTATATG GTAGCGATAC TATCGATTGA CAATACGAAG CGTTATTTAG AGACGACAGG AAAAGACAAG CGTAGTATTC AACGTTTTGC ACTGCAAGAA AAGATAAAAA GAGTATTTTC TGAAGGGTAT CATGTAGAAT CCATGCGAGA TGGTGCAGGT ATGATGGCTA TCATTATAAA CATGAAATCT TATGATTATG TAAAAGTATC TAGAGAGTTA TTTAATATAT TAACTGGTAT ACGTCAAGAA GCACAGCGTG TCTTTGAATA TACCGTAACG GTTGGAGTTT CTACTGTGCA TAATGGATAT GAGCTGATTA ATGAATGTCA TGTGGAAGCA CTAGAAGCTA TTAAGCGACG TATTATTGTG GGTAGAAATC AAATTATCTT CTGGAATCCG CAAAAAAAAG AGAATAACAA GTATTCCTAT TCTTATAATA GTGAAAAAAA GATACTAAAC TTTCTTTCAT CCGGTGATGC GGATAGTGCG AGAGTAGAGC TTATCAATCT ATTTGATGAT ATTAAGCAAA AAGAAGATAT ATCCTATGAA AATTTGTTAC TGATTTTAAA TCAGCTAGCA GGTGCTACCG TAAAATTCAT GATGGAACAC AATATTAATT CTAGTAAAGT TTTTGGTAAT AACACAAATT TATATCAAAT GATAGGTGGA ATGGATACAT TAGAGGATAT AGAAGCCTAT TTAGGAAAGG TTTTTGTATC CATTACAGAT TATTTAAAGA GTTTTCATGA AGATACATCA GAGAAAAGTT CAGAACTAAT CATTAAGTAT ATAAGAAAAC ACTACAAAGA GGAAATCGTA TTTGAAGATC TTGCAAATCA GATTGGAATT AGTTATTCGT ATATGAGAAA AGTGATACGA GAGGACACCG GGAACAGTTT GATGGACAAT GTAAATCTTT TACGTATTGA TGAGGCAAAA CGTTTATTAC TACACGCGGA TTTGAGCCTT ACTCAGATAG CAACAGAAGT TGGATATCAT AATGTACAGA GCTTGAATCG TTTCTTTAAA AAGTATGAAG GAGTATCTCC GAGTGACTTT AAGAATAATG TAAAATAA
|
Protein sequence | MKRIPIVIQL IFILLLLFLI PTVITGYYSN VQMMKYSEEE IAYSAMAQID TSSSYSEAIL MNIVKNILQM VGTNEFNGLK NITTYKALNS EYSKIKVATG IYDQMKEIQN NNKIVSSIYF CLDDADYVIS TNRGVVKKQA YEDISWINEM DLKALGSAGL WYPRTYNTAT VSELTNSKST GEVRNVISYI YRLNKLTTST KGTIVVNVDA QRLNEILLSS VNSDDAQGII VMPDGTIISH KEKSKFLKKF EDMEFVKDSL ASGITTGYQY QKDGDRAILL TFQKSSQFQW IYVNTYNMDT LMSKSDSVRN GYTIFISIII ALGTIVVIVL SREFSKPMRK LVQNVKQLNG MEQLGVKNEL SFLSGAIEKI QEQESELHHL LKEKEEEARN LLLHNLLTGE VTNQKEIENV EKIFPYNHYM VAILSIDNTK RYLETTGKDK RSIQRFALQE KIKRVFSEGY HVESMRDGAG MMAIIINMKS YDYVKVSREL FNILTGIRQE AQRVFEYTVT VGVSTVHNGY ELINECHVEA LEAIKRRIIV GRNQIIFWNP QKKENNKYSY SYNSEKKILN FLSSGDADSA RVELINLFDD IKQKEDISYE NLLLILNQLA GATVKFMMEH NINSSKVFGN NTNLYQMIGG MDTLEDIEAY LGKVFVSITD YLKSFHEDTS EKSSELIIKY IRKHYKEEIV FEDLANQIGI SYSYMRKVIR EDTGNSLMDN VNLLRIDEAK RLLLHADLSL TQIATEVGYH NVQSLNRFFK KYEGVSPSDF KNNVK
|
| |