Gene Cphy_3049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3049 
Symbol 
ID5743375 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp3727316 
End bp3728323 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content36% 
IMG OID641294150 
ProductMerR family transcriptional regulator 
Protein accessionYP_001560145 
Protein GI160881177 
COG category[K] Transcription 
COG ID[COG0789] Predicted transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGAAC TTAAAACAAT TACACAAGTA ACAAAAGCAT TTGGCGTTTC GACAAGGATG 
CTTCGCTACT ATGAACAAAT AGGATTATTG CAAAGTCAGC GTATGGACGG CTATTCCTAT
AGAGTATACA GCGAAGAATC CTGTCTACGG CTTAATCAGA TTATCATTTT ACGCAAATTG
CGCATACCTC TAAAGCAAAT AGGTATATTA CTTAATGATA GTAATACAGC GCATGCCATC
GAGGTGTTCC TAGAAAATAT CCATGAGCTG GATGAAGAAA TCGATTCTTT ATCAACGATT
CGGAATATTC TTAAAAGATT TGTGGATGAA CTGCGTTCTA AATCCGGAGT TAATTTAAAA
TCTGATTTAC TGAATGATAT TACCTTGCTG TCAATGATTT CTCCTCTCTC TTTATCAAAG
ATTAATTTTA AGGAGGAAAA CACAATGGAA GATTTAAACA AAGCAAGTGA GCAATTAAGC
AAACTAAGAG ACCGAGATGT ACGAATCGTA TATCTTCCGC CTGCAACTGT AGCAACTTAT
CAGTATGAAG GTGATGAACC TGAAATGCAT GTCAATCAGG TAATTGATCA GTTTGTACGA
GACAATGATT TAATACATAA AAAAACTGAC TTAAGACATT TTGGATTTAA TTCCCCATGT
CCTGTGGATG GAACCGAATA TCATGGTTAT GAGATGTGGA TTACTGTCCC TGATGACATA
GTAATTCCTG AACCACTGAC TAAAAAGCAC TTTGAAGGCG GTTTATATGC TGCGTATATG
ATTCCTTTCG GAGCATTTGA GGAATGGGGA CGCTTAAATG AATGGGTACA AAACAGCAGC
TTATATGAGT ACAATGGAAA TTGGGACTCT AATAATATGT TTGGCTGGCT TGAAGAGCAT
TTGAATTACA TAAATCATGT TATGTTAGAA AATTCCGAGC CGGAAGGCTT ACAGCTCGAT
TTGTTAATAC CTATTAAGGA AAGAAACTCT GAGTTAAGTT CTAAATAA
 
Protein sequence
MEELKTITQV TKAFGVSTRM LRYYEQIGLL QSQRMDGYSY RVYSEESCLR LNQIIILRKL 
RIPLKQIGIL LNDSNTAHAI EVFLENIHEL DEEIDSLSTI RNILKRFVDE LRSKSGVNLK
SDLLNDITLL SMISPLSLSK INFKEENTME DLNKASEQLS KLRDRDVRIV YLPPATVATY
QYEGDEPEMH VNQVIDQFVR DNDLIHKKTD LRHFGFNSPC PVDGTEYHGY EMWITVPDDI
VIPEPLTKKH FEGGLYAAYM IPFGAFEEWG RLNEWVQNSS LYEYNGNWDS NNMFGWLEEH
LNYINHVMLE NSEPEGLQLD LLIPIKERNS ELSSK