Gene Cphy_3034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3034 
Symbol 
ID5743360 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3707709 
End bp3709379 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content35% 
IMG OID641294135 
Producttwo component AraC family transcriptional regulator 
Protein accessionYP_001560130 
Protein GI160881162 
COG category[T] Signal transduction mechanisms 
COG ID[COG4753] Response regulator containing CheY-like receiver domain and AraC-type DNA-binding domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00618221 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATAAGC TATTGGTGGT GGAAGACGAA AAATCAATTG CATATGGAAT GGCAAATAGT 
ATTCCATGGG GGGAATGGGG ATTTGAAATT AGCGGTGTCT GCGGAAATGG ATTGGAGGCC
CTTGATCAAA TAAAAAAGGA CAAGCCTCAT GTGATAATTT CAGACATTCG AATGCCAGAA
ATGGATGGAA TAGAGCTAAT GCAGTATCTA AATCAGCATT ACCCGGAGAT TAAAATCATT
ATTCTAAGTG GCTATAATGA TTTTGAATAT ATGCAGATGT CCATTAAAAG CCAGGTTTCT
GAGTACTTAT TAAAACCTAC TGATCTTGAT GAATTCGAAG TTATGTTTCG TAAAATGAAA
GATAGACTTG ATGAAGAAAA TAGAAAAAAA ATAGAGGAAA TTGAGCTAAA ACAGGCTTAT
GAGGAAGGTA AGAGCCTAAA GCTTAGAAAG AAGTTCAATG ATTTGATAAA GGGCTATGGC
TATAACGAAG AAGAGATGGA GGAAGAATTC TTACAACATG ATGGAAACTG GTTTGGTGTT
ATTCGGATTA GTTTCGATGT CCAGAATTCG GTAGATAAGA ATGCCTTTTA TCAGAAGCAA
ATTAAGATAG AGAACCTTCT AAATGAAAAG GCTTCCAAAG AGCAGCAAAT AGTAGGAAAA
TTTATTTTAA ACTTTGAAGA AAAGATAACC GGAATTCTAA AATCTGAAGA AGAACCAGAA
GAAGAAGTCC TTCGTAGCTA TACCAAGAAG ATGTTAGAAT GCGTCATAGG AGAGGGTAAC
ATCAATGCAT ATGCAGGTAT TAGCAACTTC TATGTGGATT TTCAGATGTT ACCTCAGTGT
TATGAACAAG CGAAATGCTG TGTCGGTCAG AAAATCTATA GTGAAGCCAA AAGCCAGATT
ATGTTCTATA AAGAAATGCA GGAAGCAGAT TTTGACTATT ACGCCATTTC TTTTGACGTG
GAGAAGATTC TTAAGGAAAT TTTAGAACAG CAAGAAAAAG AATTAGAAGA GACACTGGAA
CAGATTTTCT CTGAATTCAG AGGGAAGGTG ATTCTTGATT ATGATTGTAT TAACCGTCTA
AGCCTTGAGT TAATGTTTAA CCTATCCCGT GAGCTGTTAC GCTATGGTGT TCAACTTGAG
AAAGTTATGA AAAAGATGGA TTACACATAT ACACACATAT ATACCTTTAA AAGCTTGGAA
GGGAAAAAAG AGTTTCTATT TAAAATTCTT CGAGAAGTTT CCAAAGAAAG TGCTCGAATG
AAGGGAGAAT GGAAAAACAG AAGCAGCCTT GCTCAGCAAA TAAAGGAAAT TGTGGATGCA
GAATACGATT CTAACCAGAT TTCTCTAGAA TATGTTGGTA CCAAGGTGTG TAAGAATACA
GCATACATTT CTAAAATATT TAAAAATGAA TTTGGATGTA ATTTTAGTGA TTATATTATA
TCAAAACGAT TGGAGAAAAG TCAGAAACTT CTGGCAGATC CGGCACTTAA GATATATGAA
ATAGCTGAGG AAATGGGATG GGCTGATGTA TCCAACTATA TTAAGTTATT TAAGAAAAAG
TATGGAATGA GTCCAAAAGA GTATCGTAAT ATCCTCCAAG CAGGAGGTAT CCTCCAAACA
GGAGGTAACC ACCCAAAAGG AGTTGGCCAT GCTAAAGATT CTGAAAATTA A
 
Protein sequence
MYKLLVVEDE KSIAYGMANS IPWGEWGFEI SGVCGNGLEA LDQIKKDKPH VIISDIRMPE 
MDGIELMQYL NQHYPEIKII ILSGYNDFEY MQMSIKSQVS EYLLKPTDLD EFEVMFRKMK
DRLDEENRKK IEEIELKQAY EEGKSLKLRK KFNDLIKGYG YNEEEMEEEF LQHDGNWFGV
IRISFDVQNS VDKNAFYQKQ IKIENLLNEK ASKEQQIVGK FILNFEEKIT GILKSEEEPE
EEVLRSYTKK MLECVIGEGN INAYAGISNF YVDFQMLPQC YEQAKCCVGQ KIYSEAKSQI
MFYKEMQEAD FDYYAISFDV EKILKEILEQ QEKELEETLE QIFSEFRGKV ILDYDCINRL
SLELMFNLSR ELLRYGVQLE KVMKKMDYTY THIYTFKSLE GKKEFLFKIL REVSKESARM
KGEWKNRSSL AQQIKEIVDA EYDSNQISLE YVGTKVCKNT AYISKIFKNE FGCNFSDYII
SKRLEKSQKL LADPALKIYE IAEEMGWADV SNYIKLFKKK YGMSPKEYRN ILQAGGILQT
GGNHPKGVGH AKDSEN