Gene Cphy_3211 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3211 
Symbol 
ID5741989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3915499 
End bp3917145 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content33% 
IMG OID641294311 
Producttwo component AraC family transcriptional regulator 
Protein accessionYP_001560304 
Protein GI160881336 
COG category[T] Signal transduction mechanisms 
COG ID[COG4753] Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0042226 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACGTT ACGAATATTG TACTGTCGTC CAAATTCACG GGATTGCGGC AGAATGGAGG 
ATTAACATGT TAAGAGTCTT AGTAGTGGAT GATGAGCCAT ATATCAAACA AGGGTTAGCA
ATGCTTATCA ATTGGATGGC AGAAGGGTTT ACAATCATTG GTGAAGCAGC GAACGGAAAA
GAAGCGATAC GATTTTTACA GAAAACGAAA GTAGATCTTA TCATTGCAGA TATTAAAATG
CCTGAGATGA ATGGTATCGA GCTATTAGAG TACATAAAAA AAGAAAAATT ATCCGATGCA
TCCTATATTA TTCTTAGTGG ATATTATGAC TTTCAATATG CTAGGTCTGC AATCCAACAC
AACTGTTGTG ATTATCTTTT AAAGCCTGTA CAAAAAGATG AATTACTAGA GGTACTTCGT
AGAGTTTCAC AAAAAAGCAA AGAGCTTAAT AATCGGCAAA TAGAAAATAG AGAAAAGGAA
AGAGCACTCT TTGACCGCAA TCTTTTAGCA CTCATCTGGG GAAAATTCGA TAATTACAAT
ATCGAATATG TAACAAAACA TTTAATGCTA TCAAAAGAAA TAAAGTACAT AACTATTAAT
CTTGATTTTA CAAATCAGGA AAATCTAACA GAATCAGATA AACGTAATAT TCAGAAGAAT
CTCTATATGC AGGCAAGAGA GCTACTAAAA GTTGATGAAT ACCACGTTAT ATTTGATGTC
GGTAAACAAG ATTCCTCTTA TGATATTGGT TTTATTTATT GTGATTATCT TGCAAATGAA
GCGGGGGTAA CCGAGGGGGA ATATTTAAAA CAATTCAGCC AACTATTAAG CGAAGGAATC
AACTGCAAAA TTATGCTATT TGTAGGGAAT AAGGTTGATT CGATAGCAAG GCTATCAGAG
TCGTATCAAA CCGCAATGAT TGCAAAGACT TTACAGATGT TTCGAAATGA TAAGGATATT
ACCTTTTATG AAGAAAATAT GAATTTAAAT GGAGCCTCCT TATTAGAAAG AAGAAAATTA
GATATCTTAA TTCAAGCAGT GGAAGAAAAC AATAAAGAGG AAATTTCCTC AATGGTGGAT
AGTATTTATG ATGATATTAA TCAGTCTAAT ATGGATTATA ACCTTATAAA TCTAAACATT
AGTTACATAT TATATCAACT GATTTATTTG GCAACGAAAC AGGATGATAA TATTAATCAA
GAAGAAATCC TACAATATAT TAGTAAAAAT GCCTTCCATA GAGATGCTAT GCGTGGAAGC
CGAAAGCATT TTAAGTTATT TGCTTATGAT TTTTCAGATT ATTTGCAACA ACTTCGTAAA
AATGTCTCGA AAGGTATTCT GGCTAGTATT GAAAGGGAAA TTGAGGAGAA CTATGCTGAG
AATATAAGTT TAAAATCTTT GAGTGAAAAG TATTTCATTA ATAGTGCTTA CTTGGGCCAG
GTATTCAAGA AACAATTTGG CCTACCATTT AAAGATTATT TGAATAATCA TCGTATCGAC
CGGGCAGCGG AGTTATTGCT GCGTACGGAC GAAAAGGTGT ATATCGTAGC TGAATTGGTT
GGTTATCATA ACCTAGATTA CTTCATCAAT CGTTTTGTCT CCGTGAAGGG TTGTACCCCG
ACAAGATATC GAAATCAAAG TAGGTAA
 
Protein sequence
MTRYEYCTVV QIHGIAAEWR INMLRVLVVD DEPYIKQGLA MLINWMAEGF TIIGEAANGK 
EAIRFLQKTK VDLIIADIKM PEMNGIELLE YIKKEKLSDA SYIILSGYYD FQYARSAIQH
NCCDYLLKPV QKDELLEVLR RVSQKSKELN NRQIENREKE RALFDRNLLA LIWGKFDNYN
IEYVTKHLML SKEIKYITIN LDFTNQENLT ESDKRNIQKN LYMQARELLK VDEYHVIFDV
GKQDSSYDIG FIYCDYLANE AGVTEGEYLK QFSQLLSEGI NCKIMLFVGN KVDSIARLSE
SYQTAMIAKT LQMFRNDKDI TFYEENMNLN GASLLERRKL DILIQAVEEN NKEEISSMVD
SIYDDINQSN MDYNLINLNI SYILYQLIYL ATKQDDNINQ EEILQYISKN AFHRDAMRGS
RKHFKLFAYD FSDYLQQLRK NVSKGILASI EREIEENYAE NISLKSLSEK YFINSAYLGQ
VFKKQFGLPF KDYLNNHRID RAAELLLRTD EKVYIVAELV GYHNLDYFIN RFVSVKGCTP
TRYRNQSR