Gene Cphy_1915 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1915 
Symbol 
ID5744594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2362169 
End bp2364496 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content32% 
IMG OID641293012 
ProductAraC family transcriptional regulator 
Protein accessionYP_001559023 
Protein GI160880055 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000415573 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGAA TTCCGATTGT AATTCAATTA ATTTTTATCC TGTTATTATT GTTCCTGATA 
CCAACAGTAA TTACCGGATA TTACAGCAAT GTACAAATGA TGAAGTATTC TGAAGAAGAA
ATTGCATATT CTGCTATGGC TCAAATTGAT ACGAGCAGTT CTTATAGCGA AGCTATTTTA
ATGAATATAG TTAAGAACAT ATTACAAATG GTTGGAACCA ACGAATTTAA TGGATTGAAA
AACATTACTA CATATAAAGC GTTAAACTCA GAATACAGTA AAATAAAAGT TGCCACCGGT
ATCTATGATC AGATGAAAGA GATTCAAAAT AACAATAAAA TTGTTAGTTC TATCTATTTT
TGTCTCGATG ATGCTGACTA TGTTATCTCA ACGAACCGAG GAGTTGTTAA AAAACAAGCC
TATGAAGATA TTTCATGGAT TAATGAAATG GATTTAAAAG CACTGGGTTC TGCTGGTTTA
TGGTACCCAA GAACTTATAA CACTGCAACC GTTTCTGAAC TTACAAATAG TAAGTCAACT
GGAGAAGTGA GAAATGTCAT TTCCTATATT TATCGCTTAA ATAAACTAAC GACATCAACC
AAAGGTACCA TTGTTGTTAA TGTTGATGCA CAACGATTGA ATGAAATATT ACTTTCGAGT
GTTAATTCAG ATGATGCACA GGGGATTATA GTAATGCCAG ATGGCACGAT TATATCGCAT
AAAGAGAAGT CTAAATTTCT TAAAAAGTTT GAAGACATGG AATTTGTAAA AGATTCTTTA
GCGAGCGGTA TAACAACAGG CTATCAATAT CAAAAGGATG GAGATCGTGC TATCTTATTA
ACTTTTCAAA AATCATCACA ATTCCAGTGG ATTTATGTCA ACACTTATAA TATGGATACG
CTGATGAGTA AATCAGATAG TGTGCGTAAT GGGTATACCA TATTTATCTC AATTATAATA
GCTCTTGGGA CTATTGTAGT AATTGTATTA TCTAGAGAGT TTTCTAAACC AATGAGAAAG
TTAGTTCAAA ATGTGAAACA GCTCAATGGT ATGGAGCAAC TTGGGGTGAA AAATGAACTT
TCTTTTTTAA GTGGTGCAAT TGAAAAAATA CAGGAGCAAG AGAGTGAATT GCATCACCTT
TTAAAGGAAA AGGAAGAGGA AGCTAGAAAC TTACTTCTTC ATAATTTACT AACTGGCGAA
GTTACGAATC AAAAAGAAAT CGAAAATGTT GAAAAAATAT TTCCATACAA TCATTATATG
GTAGCGATAC TATCGATTGA CAATACGAAG CGTTATTTAG AGACGACAGG AAAAGACAAG
CGTAGTATTC AACGTTTTGC ACTGCAAGAA AAGATAAAAA GAGTATTTTC TGAAGGGTAT
CATGTAGAAT CCATGCGAGA TGGTGCAGGT ATGATGGCTA TCATTATAAA CATGAAATCT
TATGATTATG TAAAAGTATC TAGAGAGTTA TTTAATATAT TAACTGGTAT ACGTCAAGAA
GCACAGCGTG TCTTTGAATA TACCGTAACG GTTGGAGTTT CTACTGTGCA TAATGGATAT
GAGCTGATTA ATGAATGTCA TGTGGAAGCA CTAGAAGCTA TTAAGCGACG TATTATTGTG
GGTAGAAATC AAATTATCTT CTGGAATCCG CAAAAAAAAG AGAATAACAA GTATTCCTAT
TCTTATAATA GTGAAAAAAA GATACTAAAC TTTCTTTCAT CCGGTGATGC GGATAGTGCG
AGAGTAGAGC TTATCAATCT ATTTGATGAT ATTAAGCAAA AAGAAGATAT ATCCTATGAA
AATTTGTTAC TGATTTTAAA TCAGCTAGCA GGTGCTACCG TAAAATTCAT GATGGAACAC
AATATTAATT CTAGTAAAGT TTTTGGTAAT AACACAAATT TATATCAAAT GATAGGTGGA
ATGGATACAT TAGAGGATAT AGAAGCCTAT TTAGGAAAGG TTTTTGTATC CATTACAGAT
TATTTAAAGA GTTTTCATGA AGATACATCA GAGAAAAGTT CAGAACTAAT CATTAAGTAT
ATAAGAAAAC ACTACAAAGA GGAAATCGTA TTTGAAGATC TTGCAAATCA GATTGGAATT
AGTTATTCGT ATATGAGAAA AGTGATACGA GAGGACACCG GGAACAGTTT GATGGACAAT
GTAAATCTTT TACGTATTGA TGAGGCAAAA CGTTTATTAC TACACGCGGA TTTGAGCCTT
ACTCAGATAG CAACAGAAGT TGGATATCAT AATGTACAGA GCTTGAATCG TTTCTTTAAA
AAGTATGAAG GAGTATCTCC GAGTGACTTT AAGAATAATG TAAAATAA
 
Protein sequence
MKRIPIVIQL IFILLLLFLI PTVITGYYSN VQMMKYSEEE IAYSAMAQID TSSSYSEAIL 
MNIVKNILQM VGTNEFNGLK NITTYKALNS EYSKIKVATG IYDQMKEIQN NNKIVSSIYF
CLDDADYVIS TNRGVVKKQA YEDISWINEM DLKALGSAGL WYPRTYNTAT VSELTNSKST
GEVRNVISYI YRLNKLTTST KGTIVVNVDA QRLNEILLSS VNSDDAQGII VMPDGTIISH
KEKSKFLKKF EDMEFVKDSL ASGITTGYQY QKDGDRAILL TFQKSSQFQW IYVNTYNMDT
LMSKSDSVRN GYTIFISIII ALGTIVVIVL SREFSKPMRK LVQNVKQLNG MEQLGVKNEL
SFLSGAIEKI QEQESELHHL LKEKEEEARN LLLHNLLTGE VTNQKEIENV EKIFPYNHYM
VAILSIDNTK RYLETTGKDK RSIQRFALQE KIKRVFSEGY HVESMRDGAG MMAIIINMKS
YDYVKVSREL FNILTGIRQE AQRVFEYTVT VGVSTVHNGY ELINECHVEA LEAIKRRIIV
GRNQIIFWNP QKKENNKYSY SYNSEKKILN FLSSGDADSA RVELINLFDD IKQKEDISYE
NLLLILNQLA GATVKFMMEH NINSSKVFGN NTNLYQMIGG MDTLEDIEAY LGKVFVSITD
YLKSFHEDTS EKSSELIIKY IRKHYKEEIV FEDLANQIGI SYSYMRKVIR EDTGNSLMDN
VNLLRIDEAK RLLLHADLSL TQIATEVGYH NVQSLNRFFK KYEGVSPSDF KNNVK