Gene Cphy_0338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0338 
Symbol 
ID5742184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp425701 
End bp427869 
Gene Length2169 bp 
Protein Length722 aa 
Translation table11 
GC content37% 
IMG OID641291428 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_001557464 
Protein GI160878496 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTT TCGAACAATT AAAAGAAGAG TTAGGAATTA AACTGGAACA GGTCGAAGCT 
ACTGTAAAGC TCATCGATGA GGGAAATACG ATACCTTTTA TTGCTAGATA TCGAAAAGAG
GTTACCGGTT CTTTAAATGA TGAGGTACTA CGAAGTCTAT CAGAGCGTTT AACCTATCTG
CGAAACTTGG AAGAGAAGAA GGCTTCTGTA CTTGCAAGTA TCGAAGAACA GGGAAAATTA
ACAGAAGAGT TAAAGGCACA GATTGAAGCT GCGGCAACCT TAGTAGTAGT AGAAGACTTA
TATCGCCCAT ATCGTCCAAA ACGTAGAACC AGAGCGACAA TTGCAAAAGA GAAGGGATTG
GAGCCACTGG CAGAAATCAT CATTGCTCAG GAAATTTCTA GGGATATCTT AGAATTAGCA
GAAGACTATG TATCGGAAGA GAAGGAAGTA AGTTCTGCTA AAGACGCTGT AGCAGGAGCA
CTGGATATTA TTGCAGAAAG CATCTCTGAT GAAGCGGATT ATCGTATCCA CATACGTAAG
CTTACCTTTG AAGAAGGAAG TATAACTTCC GCCGCAAAGG ATGCAGAAGC CGAATCTGTA
TATGAACTTT ATTATGAATA TAGTGAAAAA ATTTCTAAGG TAGCGGGTCA CAGAGTTTTA
GCCTTAAATC GCGGTGAAAA TGAGAAATTT CTTACAGTAA AGATTACAGC GCCAACTGAG
AAAATAATGA ATTATCTTAA GGGTAAAGTG ATAAAGAGAG ATAATCCACA TACCAATGAT
TATCTTGTAG TAGCAATTGC AGATAGTTAT GACCGTTTAA TCGCACCTGC AATTGAACGT
GAGATCAGAA ACGATTTGAC GGAAAAAGCA GAGGATGGTG CAATTGTAGT CTTTGGCCAG
AACTTAACTC AAGTATTAAT GCAGCCTCCA ATTGCTGGAC AAGTTGTACT TGGATGGGAT
CCAGCGTTTC GTACCGGTTG TAAGCTGGCT GTAGTAGATG CAACAGGAAA GGTATTAGAT
ACTGTAGTAA TCTATCCGAC AGCACCACAA AATAAGGTAG AGGAAGCGAA GACAGTTTTA
AAGAAATTAT TTAAGAAATA TCAGATTACC TTAATTTCTG TTGGAAATGG TACTGCTTCC
AGAGAATCCG AGCAGATTAT TGTAGACTTA TTAAAAGAAA TTACTGAGCC AATCTCTTAT
GTAATTGTAA ATGAGGCTGG AGCATCGGTA TATTCAGCAA GTAAGTTAGC AACCGAGGAG
TTTCCTAATT TTGATGTTGG ACAAAGAAGT GCCGCTTCTA TTGCAAGAAG AATTCAGGAC
CCATTAGCTG AGTTAGTAAA GATTGATCCT AAATCGATCG GCGTAGGTCA ATATCAACAC
GATATGAACC AAAAGAAACT AAGCGAAGCA TTAGGCGGAG TTGTAGAAGA TTGTGTAAAT
AAAGTTGGAG TAGATCTAAA TACTGCATCA GCTTCCTTAC TTGAATATAT TTCTGGAATT
TCAAAACCAA TCGCTAAGAA TATTGTTATT TACCGTGAGG AAAATGGAAA GTTTAAGAAT
CGAAAGCAAT TATTAAAGGT TGCGAAACTT GGGCCAAAGG CTTTTGAACA ATGTGCAGGA
TTTTTACGAA TTAATGATGG AGATAATCCT CTGGATGGAA CTAGTGTGCA TCCAGAATCT
TATGAAGCAG CTGAAAAACT TCTAGAGATG CTTGAAGTGA AAGATTTAAG AGAGTTACAA
GCGAAAGCAA AAGAAGTAGA TGCTAAAAGA GAACAATCGA TTAGCCAGAA AGTTAAGGAT
AAAAAGAAGA TGGCACAGGA GCTTTCGATT GGTGAAATTA CTTTAACTGA TATCATTAAG
GAATTAGAAA AACCAAGCAG AGACCCAAGA GAAGAGATGC CAAAACCAAT CTTAAGAACC
GATGTTCTTG ATATGAAAGA CTTAACTGAG GGTATGATTT TAAAGGGTAC TGTACGTAAT
GTCATTGATT TTGGAGCATT TGTTGATATT GGAGTTCATC AAGATGGTTT GGTTCATATA
TCACAATTGT CAAAGCAGAA ATTTGTTAAG CACCCACTTG ATATCGTTAG TGTAGGAGAT
ATAGTGGAAG TAAAAGTTTT AAGTGTAGAT GTATCGAAAA AAAGAATACA ACTCTCTATG
ATTTTATAA
 
Protein sequence
MKIFEQLKEE LGIKLEQVEA TVKLIDEGNT IPFIARYRKE VTGSLNDEVL RSLSERLTYL 
RNLEEKKASV LASIEEQGKL TEELKAQIEA AATLVVVEDL YRPYRPKRRT RATIAKEKGL
EPLAEIIIAQ EISRDILELA EDYVSEEKEV SSAKDAVAGA LDIIAESISD EADYRIHIRK
LTFEEGSITS AAKDAEAESV YELYYEYSEK ISKVAGHRVL ALNRGENEKF LTVKITAPTE
KIMNYLKGKV IKRDNPHTND YLVVAIADSY DRLIAPAIER EIRNDLTEKA EDGAIVVFGQ
NLTQVLMQPP IAGQVVLGWD PAFRTGCKLA VVDATGKVLD TVVIYPTAPQ NKVEEAKTVL
KKLFKKYQIT LISVGNGTAS RESEQIIVDL LKEITEPISY VIVNEAGASV YSASKLATEE
FPNFDVGQRS AASIARRIQD PLAELVKIDP KSIGVGQYQH DMNQKKLSEA LGGVVEDCVN
KVGVDLNTAS ASLLEYISGI SKPIAKNIVI YREENGKFKN RKQLLKVAKL GPKAFEQCAG
FLRINDGDNP LDGTSVHPES YEAAEKLLEM LEVKDLRELQ AKAKEVDAKR EQSISQKVKD
KKKMAQELSI GEITLTDIIK ELEKPSRDPR EEMPKPILRT DVLDMKDLTE GMILKGTVRN
VIDFGAFVDI GVHQDGLVHI SQLSKQKFVK HPLDIVSVGD IVEVKVLSVD VSKKRIQLSM
IL