Gene Cphy_1706 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1706 
Symbol 
ID5741537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2092890 
End bp2095226 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content27% 
IMG OID641292806 
ProductAraC family transcriptional regulator 
Protein accessionYP_001558817 
Protein GI160879849 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0404668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAAAATA CAAATCAAAA CTTTAAGAAT AATAGTTTCT TTAAGAAAGT ACTTTTTAGT 
ATTATAACAC TGATAATTAT AATAATACTA ACCTATGCGA CTGTTACATA CATAGGGATG
AAAAAAACTA TTTTAGATGT GAAAAATTCT GCCAATATGA ATGAGTTGGC ACAAGCGAAT
AGTACAATAA ATTATTTATT TGAAATGACT AAAAATTTGG CCTTATACAT TTACCAAGAT
GAGGATTTGG TTAAGCTTTT GCATATTGAA GATAAAGAAT TTTTAAATAG TTTAGATTAT
ATAAAGTTAC GAACCAAATT GAACACCTAT ACCCATACAT TTGAGTTCTC AGATAGTATT
ATTATATACA ATAGCAAATT AGATTTGATA ACTTCTACGG AATATTCCAT TCAAAACTAT
GATAAACCAT TGGCGAATGC CATTAAAAAA TATATTAACA ATGATATGAA GGAATATGCA
GAGTTTGCTA TTTTAGACTA TTTGGAGGAA GACGGTAAAA AGAATACTGC GTTTTTGTTT
GGCTTAAAGG ACTGGAAGTT TATTACTCCG GATAATCAAA CAACGATTGG TATTCTAATT
AAACCAGAAT GGTTATTCGA TAACTTAGAG ATTATTAATA AAGCTGATGC AAATCAAGAA
AAAGAAATAT TTATATTGGA TAATAAAGGG GAATTATATA GTTCCGATGC AAAGTTAGTA
AAAGATGAGA GTTTAAATGA ATTAAAAAAT ATAGTAATGA ATCAACCTCA GAATAGTGAC
TACTTTGATG TTTATATACA TAATGTAAAA TATAAAGCCA CCTACATAAA AAATGAAATG
TGTAAATGGA AAATTATATC GATTCAGCCT TATAATGTGT TTATGTCACA GTTATATAAA
ATTACGGGAA TATTTATCGC CATAACATTT ACTATCATTT TAATAGCCTT AGGATTAGCT
TTTATTTTTA CAAAACATAT TTATGTGCCT GTGAATAAAG TGGTAAAACA ATTTATCCAT
AAGAATAAAA ACATAAGTAA TGATTTAAAA ATTGAAGACG AACTAGGTTT TATTGTAAAG
AGCTACGAAA ATGCTGTATC CCAGATTTCT ATACAACAAA GTGATTTAAA AAGTTCTAAA
AAATATATAA GAAGTTATTG GATAAAAAGA CTCCTAATGG AAAGTAAAAT GCTCTCTTTA
GAGGAGTTGA AGAAAAATGA TGTAGAAGAA TTATTGAATG TCAATTTATT AGAAGAGTTT
ATCATTATTA TTCTAAATAT CGATGAAAAT GGAGAATTCA ATAGACATAC TATTGAAAAT
CAACGTATAT ACCGCTATGC TATTGAAAAT ATTTCTCAAG AAGTTATAGG TGAATCATTT
CCTTGTAATA TAGTCGATAT GGGTGAGGAG AACCTTGTAG TTTTAGTCAG CCTAAAAGAT
ACAAAGGGAG TAGTAGCTCA AATAGAAGAA TGTGTCAGAA AAATACAGCA GACAGTTGTT
ACTTATTATG AATTCTCTCT ATCAGCAGCA ATTTCTGATA AAATAGAAAA TTATAGCGAT
ATTTCTAAAA GCTATAAAAA AGCTTTACAT TTATTAAGTT ATAAATTGAT CTATGGAAAT
GAATGTATGA TCAAAGAATC AATGCTAGAA GAAATATTCG AATCAAATAA AGAAATGTTT
ATTTTACAGA AAGAGCAAAA ATTAGAAGGT CTTTTCATAA GTGATAAGGA AGAGTTATTC
AGAACAGAAA TAAATGAAAT ATTTGAGACG ATTAAAAACA TGAAATATAG TGAAATTATG
AGTAGTATTA ACTACTTAAG CTTTTTGTTC TATAAGATTA TTAAGATAAA CTTTCCAATG
CAGTTCAATG AACAAATAAA GCAAATGAAT ATATTAAATA AGAATATATT TAACAGTGCT
AGCTTGGAAG AAATTAAAGA AATTTTTATT GAAATTTATA TCAATATTCA TAAAAATCCA
CAAGAGAATA TTTCAACCAC CAATAACATG TTAGTAAGTA CTATTATTCA AATTATTGAA
GAGAATTATA AAGATCCTAA TTTGAGCCAG GAGTGGATTG CATCTACTTT AAAGCTTTCT
TATAGTAATG TCGGAAAGGT ATTTAAATTA GTTGAGAAAG TTTCGATAGC GGAATATGTG
AATAAGGTTC GGTTAAAATA CGCTTGTGAG TTATTGGAAA ATACGAATTA TAGTATAAAT
GACATTTTTA ATAGTGTAGG GTTTGTAAAT CAAAGTTATT TTTTCACCTT ATTCAAAAAA
TATTTTGGAT GTACTCCGAA GCAATATCAA TTACAGAAGA AGTTTAAACA ATTTTGA
 
Protein sequence
MENTNQNFKN NSFFKKVLFS IITLIIIIIL TYATVTYIGM KKTILDVKNS ANMNELAQAN 
STINYLFEMT KNLALYIYQD EDLVKLLHIE DKEFLNSLDY IKLRTKLNTY THTFEFSDSI
IIYNSKLDLI TSTEYSIQNY DKPLANAIKK YINNDMKEYA EFAILDYLEE DGKKNTAFLF
GLKDWKFITP DNQTTIGILI KPEWLFDNLE IINKADANQE KEIFILDNKG ELYSSDAKLV
KDESLNELKN IVMNQPQNSD YFDVYIHNVK YKATYIKNEM CKWKIISIQP YNVFMSQLYK
ITGIFIAITF TIILIALGLA FIFTKHIYVP VNKVVKQFIH KNKNISNDLK IEDELGFIVK
SYENAVSQIS IQQSDLKSSK KYIRSYWIKR LLMESKMLSL EELKKNDVEE LLNVNLLEEF
IIIILNIDEN GEFNRHTIEN QRIYRYAIEN ISQEVIGESF PCNIVDMGEE NLVVLVSLKD
TKGVVAQIEE CVRKIQQTVV TYYEFSLSAA ISDKIENYSD ISKSYKKALH LLSYKLIYGN
ECMIKESMLE EIFESNKEMF ILQKEQKLEG LFISDKEELF RTEINEIFET IKNMKYSEIM
SSINYLSFLF YKIIKINFPM QFNEQIKQMN ILNKNIFNSA SLEEIKEIFI EIYINIHKNP
QENISTTNNM LVSTIIQIIE ENYKDPNLSQ EWIASTLKLS YSNVGKVFKL VEKVSIAEYV
NKVRLKYACE LLENTNYSIN DIFNSVGFVN QSYFFTLFKK YFGCTPKQYQ LQKKFKQF