Gene Cphy_3700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3700 
Symbol 
ID5742724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4549489 
End bp4550538 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content37% 
IMG OID641294810 
ProductLacI family transcription regulator 
Protein accessionYP_001560786 
Protein GI160881818 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000696494 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAAGG AAAAAATATC AGTACAGGAG AATAAGAAGA TGGCTACGAT TCGAGACATT 
TCGTTAAAAT GTGGGGTATC GGTTTCTACA GTAAGTAAGG TATTAAATGG ATACCGAGAA
ATTGGGGAAG AGACTTCGAA AGCGGTTATG AAAGCAGCAG AGGAGCTTGG GTATGTACCA
AACTCTTATG CCAGACAACT GAAACTTAAA AAATCGTATA ATATAGGTGT ATTATTTGAT
ACCTTATCTG TTTATGGTTT AAAGAATGAA TACTTTGCAC ACATTTTAGC CGCATTGAGA
GAGAATGCGA GCCAAGGGGG CTATGATATC ACCTTTATTG AGAATAATAT AGGGAATCGT
AGAATGACCT ATCTTGAACA CTGTAAATAT CGAAATTTCG ATGGAATTTG TATTGTATGC
GCTGATTTTA CTAATCCTGA GGTATTAGAT GTAGTCAATA GCGATTTTCC AGTGGTAACA
ATTGATCATT CATTTAACGA AGCAATTAGT ATCTTATCAG ATAATTCTGG TGGTATGAGA
GATTTGGCCC AGTATATTGT TTCGATGGGA CATAAGAAAA TAGCTTATAT TCATGGTAAT
AAGAGTTCTG TAACTCATAA TCGTTTGGTC GCATTTCACC AAGTATTAGC GGAACATGAT
ATTGTGATAC CGGAGTATTA CATGAAGGAA GGCGAATATC GCCTTGCTGA GTCAGCAGAG
GAATTTACGT ATGAATTACT AAATCTTTCG GATAGACCAA CTTGTATCTT AGCTTCTGAT
GACTATGCTG CACTAGGCGT GATAAAAGCA ATTAAGCGAG CTGGGCTAAG GTTTCCAGAG
GATATATCGG TAGCAGGGTT TGATGGAATA TCCATCTCTC AAGCGCTTGA GCCTAAGCTA
ACTACAGTAA AGCAGGATAC TGAAAAATTA GGGGAACAGG CAGCAAAAAA ATTAATTTGT
TTAATGGAAA GTCCTATGAC TACCCCTTTG GAGCATATTG TATTAAAGGC AGAATTAATC
ATTGGTGATT CTGTTAAAAA ACTTAGATAA
 
Protein sequence
MSKEKISVQE NKKMATIRDI SLKCGVSVST VSKVLNGYRE IGEETSKAVM KAAEELGYVP 
NSYARQLKLK KSYNIGVLFD TLSVYGLKNE YFAHILAALR ENASQGGYDI TFIENNIGNR
RMTYLEHCKY RNFDGICIVC ADFTNPEVLD VVNSDFPVVT IDHSFNEAIS ILSDNSGGMR
DLAQYIVSMG HKKIAYIHGN KSSVTHNRLV AFHQVLAEHD IVIPEYYMKE GEYRLAESAE
EFTYELLNLS DRPTCILASD DYAALGVIKA IKRAGLRFPE DISVAGFDGI SISQALEPKL
TTVKQDTEKL GEQAAKKLIC LMESPMTTPL EHIVLKAELI IGDSVKKLR