Gene Cphy_3550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3550 
Symbol 
ID5742954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4383170 
End bp4384411 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content36% 
IMG OID641294661 
Producthypothetical protein 
Protein accessionYP_001560638 
Protein GI160881670 
COG category[S] Function unknown 
COG ID[COG4856] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0348411 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGG TAATAACTAG AAATATACCG CTTAAAATCA TATCTTTCCT TATCGCAGTG 
TTATGTTGGG TAATGATTAT GAATATTTCA GATCCCTACA TAACCAGTAC GATTGATAAT
ATTGAAGTTA AAACCATTAA CACGGAGACT ATGGAACAAC ATAACAAGCG TTATGATGTA
GAGTCTGGCG ATATTATCTC AATAAAGGTT CGGGGAAAAC GCTCGATTAT CGACGGATTA
AAGAATACTG ACTTTATAGC GGTTGCTGAT TTTAAAGAGA TGTCTATGGT GGATGCTGTA
CCTATTCATG TATCACCAAA GCAATCTTAC AGATACAATG CAGATGAAAT AGAGATTCTA
GAGCAGACAC AGATGATGAA ACTGACACTA GAAGAGTTAG ATAAGCAAAC CTTCCGTGTT
AATGTTAGAC AGACTGGAGA AGCGAAGGCA GGTTTCTATG TTACGGAATT AATCGCTAAT
CCAAGCATTA TCGAAATCTC TGGATCGAAA AGAAAGATTG CTAAAATTAA GGATGTAGTT
GTTGAGGTTA ACGTTGAACA GGTAAGTAAC TCTTATCAAG TTACAAAAAA ACTAGTTGCT
TATGATGAGA ATGGATATAT TATAGACTCT GAAAAACTTG ATTTTGAGAC TAAAGAAGCA
ACGATAGATG TGACTGTGCT ACCAACAAAG ACAATACCAA TTCAAGTATC TGCAGTTGGA
ACTCCTGCAT ATGGCTATAA ATGCACCGAC GTTGTTTGGG AGCCAAAAAC CATTACCATT
GCTGGAGAGC AAAAAGATCT GAATAAGATT TATTGGTTAA AGCAACAGAT AGACATTAGT
GGTAAGAAAG AAACCTTCCC AGAGAAAAGA AATATTGAAA CAATCTTAGA AGATACTTAT
CCAGGAATGT ATACTTTAGT CGATGAAAGT AATACCTTTG ACATCACGGT TAAGATTGAC
CAGTTAGGTA GCAAAGATAT AACGATACCG ACGTCAGACA TTCAGGTTAG AAATTTAGAT
CCTGATTATG AAGTTATTTT TAGGACACTT GGTAATATAA ATGTACGCGT CAGAGGTGTT
TCAGGATCGT TAAATGAGGT ATCTGCATTA ACGATACGGC CATATATAGA TGTAACAAAT
TATGGACTGG GAGTCCATTC GGTTACGGTA CAATATAAAT CAAATGAGGA ACTCACAATT
CAGCCTGTTA CGATTAGTAT TGAAGTGGTG AAGAGAGAAT AG
 
Protein sequence
MKKVITRNIP LKIISFLIAV LCWVMIMNIS DPYITSTIDN IEVKTINTET MEQHNKRYDV 
ESGDIISIKV RGKRSIIDGL KNTDFIAVAD FKEMSMVDAV PIHVSPKQSY RYNADEIEIL
EQTQMMKLTL EELDKQTFRV NVRQTGEAKA GFYVTELIAN PSIIEISGSK RKIAKIKDVV
VEVNVEQVSN SYQVTKKLVA YDENGYIIDS EKLDFETKEA TIDVTVLPTK TIPIQVSAVG
TPAYGYKCTD VVWEPKTITI AGEQKDLNKI YWLKQQIDIS GKKETFPEKR NIETILEDTY
PGMYTLVDES NTFDITVKID QLGSKDITIP TSDIQVRNLD PDYEVIFRTL GNINVRVRGV
SGSLNEVSAL TIRPYIDVTN YGLGVHSVTV QYKSNEELTI QPVTISIEVV KRE