Gene Cphy_0694 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0694 
Symbol 
ID5743810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp896904 
End bp900245 
Gene Length3342 bp 
Protein Length1113 aa 
Translation table11 
GC content38% 
IMG OID641291806 
Producthypothetical protein 
Protein accessionYP_001557820 
Protein GI160878852 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.514428 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATACTAA AAACATTGTC TGCTCCAATC ATCCTAGAAA ACAGTAACAG TACTTTTACT 
TTCTTACCAG GAGGAGACAA CTTTGAATGG ATTCATGAAT CCATTATGAT CAATGCTTTT
CAAGGTAATA CGCTTGATGG CTCTACCAAC AATTTGTATC TTCGAATATA CAAAGATAAT
TCCCTTGCTT TCTATCCTCT CATCGGGATG AACTCCAAAA GCACTATAAA ATCCGGTACA
TCAACACTTA TTTTTGAAGG AACGGCTGAG GATATTTCCT ATACCGTTAC TTTTCGTTTA
ACCCCATACG GTATTTGGTT CTGGGACATA TCTCTGTCTG GAAATTGCAA CAAAGCAGAT
ATTATCTATT CCCAGGATAT CGGTGTCGGT ACAAAGGGAA GTGTTAATTC CAATGAATTG
TATTTGGCAC AATATTTAGG TCATAGCATC TTTCAAGGAG ATTATGGCTA CGTGATATGT
TCCAGACAAA ATATGGCTCA GGGTGATTTA TTTCCATACC TCCAACAAGG TAGCTTAGGT
ATTCGTTCCA TTGCTTATTC TACCGATGGT ACACAATTTT TTGGTCTATC TTATAAAAAA
ACAAATATCC CTGAAGCATT GTACGGGGAT CTTCCAAGCA AAAATAAGCA GTATGAGTTA
GCTCATACCG CCCTTCAAAC CGAAGCCTTT TCGCTCTCTG GAACAAAACA ATTCTCTTTT
TATGGTATTT GTAAAACTAA TCATCCGGAA GTAATCCGTG AAATAGAGTA CATACAGGAA
TTAGAAAAAG CCTATGCCTA CCATGAATCT GGAGAGATCT TGCCTGTTAA TGTTCCTACC
CTTCAAAACA TAGGGGCTCC TTATGCTTCT TCTCGTTGGG ATGCCAAACA GGTAGAACAT
TATTTCCCAA AGAGACTTCT TGAGGAAAAA GAGGAGGAAG CATTACTCTC CTTTTTCACA
CCAGAAAAAA GTCATGTTGT ATTACAAGAC AAAGAACTAA CAACAGAACG TCCACACGGA
CATATTCTTA TGACAAATTT TGATGTAACT AAGGTTCCTC AAGGTGTAGT AAGTTCTACA
AACTATATGT ATGGTGCATT TAACTGCCAG TTTGTAGTAG GAAATACAAC TTATAATAAA
CTGCTATCCA ATCACAGAGG TTTATTAAAC ATCCAAAAAG ACAGCGGTCA GAGAATCTTT
ATTAAAATTG GCGACTGCTA TAGACAACTT ACCTTACCTG CTGCCTACGA GATGAATGTT
GCGGGCAGTA CCTGGTATTA TCAGCTAGAC GAGGATGTCT TAATAATTAC CTCCTTTGCA
ATGTATAACC GACCGGAAAT TGTACTAAAA GTTCAGTCTC TTGGGCATAA AAAATATGAT
TTTATTGTTA CTCATCAGCT TACGGTAGGA CCGAATGAGT ATGAAAATGA AATCAAATTG
ACAAGAGAGG GTAATATACT ACAGCTTTCC CCAACCGATC CTGTTGTTAC GAATCATTTT
TATCCTGAAC TTTCGTTCCG AATGAGAATA CCAGAGGACT GTACGTTATC AGACGATAGC
ATTTTCTTTC ATAATAATAC GACGATTAAT CCATCTTTAC TATCGATAGA AATACTTCAA
AAATCCAGCT TTGATATTGT AATGCAAGGA TTTGATACAG GAAATGTAAT TCCATTCTTA
GATCAGTATG ACTATAAAGA ACAGCTTGAA GCTTATCGCA TTTACTATGA TCAATTGGTT
TGTAATTTTA AATTAAGTGC TCCAGACAAA ATTCCACTTT CAGCAGAGAA ACTAAATGCA
ATCATACATT GGTATGCTCA TGATGCCTTA ATCCATTTTG CATCTCCACA TGGTCTTGAG
CAATCCGGTG GTGCAGCATG GGGTACACGA GATGTATGCC AAGGCCCGAT AGAATTTTTC
TTAACTACAG GGCATTTTGA CCTCGTACGA CATATTCTGA TAACTCTTTA CTCCCATCAG
ATAGAGGGGG GATTCGAGTG GCCTCAGTGG TTTATGTTTG ATCATTATCC TATACATCAA
GAAGATTGTC ATGGGGATGT TGTATTTTGG CCATTAAAAG CTATCAGTGA TTATATACAG
GCAACCGGAG ACACCTCCAT TCTAAATGAG CTTGTAGATT ACCGCACTGC GAAAGATGCT
TTGCCTACCA ATCAGCCGGA AACAATACTG ATTCATATCA AGCGTGCTGT CACTACAATA
AAGAATCGTT ATCTATCCGG GACAGCGCTA ATTTCCTACG CTGGTGGAGA CTGGGATGAT
ACCTTGCAAC CAGCCAATAG TGAATTAAAA GAAAATCTGG TAAGCGCCTG GACACAAGCT
CTTGCAGAAC AGACCTTAGA ACTTCTTTGC AGTGCTATTA AGGGTATCGA CCATGATTTT
TCGAAAGAAT TATCTCATAT GGCAAACGAC ATCAGAACAT CCTTCTATCA ATATCTGATA
AAAGACGGGG TTATCGCAGG TTTTCTTTAT AGAGAATCCG AAGAACATAT GAAGTATATG
TTGCATCCGG ATGATACGGA GTCATCCATC CATTATCGAC TACTGCCACT AACCAGAAGT
ATTATAGCAC AATTGGCTGA TTTTAAGTTA GCAACTCGTA ATTTAGAAAT TATCGATGAA
CATCTGGCAT GTCCGGATGG CGTTCGTTTG ATGGACCATC CAGCTAGTTA TAGCGGTGGA
ATCAGTAAAA TATTCTTACG TGCAGAACAA GCAGCAAATG TTGGTAGAGA AATCAGCCTG
CAGTATGTGC ATGCTCACAT TCGCTATATA GAAGCATTAG CAACTATGGG CCTTTCTAAA
AAAGCATGGG ATGCCTTAAT GCGTATTAAT CCGATCTTAT TGACAGATTA TGTACCAAAT
GCCCTTACTC GTCAAAGTAA TGTTTACTTT AGCAGTTCCG AAGGATGCTT TGATGATCGT
TATGAATATG CAAAAAACTT TGATAAACTA AGAACCGGAG ATATTAATGT GAAAGGTGGT
TGGAGGCTCT ATTCCAGCGG TCCTGGTATT TATATCCGAA GAATCATTGC AGATTTGCTA
GGAATTCGTT TCGGCCATAA TGTTATCCAC ATCGATCCTG TTGTTACGAA AGAATTAGAT
GGCGTAACGC TACAATTTAC TTGCTTTGGA AAGACAGTTT TCTTTACCTA TCATGTCGAT
GACACGATGG ATAAACATAT TTGTGTTAAA TCAAATAATA ACATACTGCC TGGAGACAAC
TTAAACAATA TCTACAGAGA CGGTGGCATT CAGATTGCAA AAGATGTTTT CTTATCTGCT
GCTATGAGCG ATAATAATTT TCATATTTAT GTTAAGAACT AA
 
Protein sequence
MILKTLSAPI ILENSNSTFT FLPGGDNFEW IHESIMINAF QGNTLDGSTN NLYLRIYKDN 
SLAFYPLIGM NSKSTIKSGT STLIFEGTAE DISYTVTFRL TPYGIWFWDI SLSGNCNKAD
IIYSQDIGVG TKGSVNSNEL YLAQYLGHSI FQGDYGYVIC SRQNMAQGDL FPYLQQGSLG
IRSIAYSTDG TQFFGLSYKK TNIPEALYGD LPSKNKQYEL AHTALQTEAF SLSGTKQFSF
YGICKTNHPE VIREIEYIQE LEKAYAYHES GEILPVNVPT LQNIGAPYAS SRWDAKQVEH
YFPKRLLEEK EEEALLSFFT PEKSHVVLQD KELTTERPHG HILMTNFDVT KVPQGVVSST
NYMYGAFNCQ FVVGNTTYNK LLSNHRGLLN IQKDSGQRIF IKIGDCYRQL TLPAAYEMNV
AGSTWYYQLD EDVLIITSFA MYNRPEIVLK VQSLGHKKYD FIVTHQLTVG PNEYENEIKL
TREGNILQLS PTDPVVTNHF YPELSFRMRI PEDCTLSDDS IFFHNNTTIN PSLLSIEILQ
KSSFDIVMQG FDTGNVIPFL DQYDYKEQLE AYRIYYDQLV CNFKLSAPDK IPLSAEKLNA
IIHWYAHDAL IHFASPHGLE QSGGAAWGTR DVCQGPIEFF LTTGHFDLVR HILITLYSHQ
IEGGFEWPQW FMFDHYPIHQ EDCHGDVVFW PLKAISDYIQ ATGDTSILNE LVDYRTAKDA
LPTNQPETIL IHIKRAVTTI KNRYLSGTAL ISYAGGDWDD TLQPANSELK ENLVSAWTQA
LAEQTLELLC SAIKGIDHDF SKELSHMAND IRTSFYQYLI KDGVIAGFLY RESEEHMKYM
LHPDDTESSI HYRLLPLTRS IIAQLADFKL ATRNLEIIDE HLACPDGVRL MDHPASYSGG
ISKIFLRAEQ AANVGREISL QYVHAHIRYI EALATMGLSK KAWDALMRIN PILLTDYVPN
ALTRQSNVYF SSSEGCFDDR YEYAKNFDKL RTGDINVKGG WRLYSSGPGI YIRRIIADLL
GIRFGHNVIH IDPVVTKELD GVTLQFTCFG KTVFFTYHVD DTMDKHICVK SNNNILPGDN
LNNIYRDGGI QIAKDVFLSA AMSDNNFHIY VKN