Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_0694 |
Symbol | |
ID | 5743810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 896904 |
End bp | 900245 |
Gene Length | 3342 bp |
Protein Length | 1113 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641291806 |
Product | hypothetical protein |
Protein accession | YP_001557820 |
Protein GI | 160878852 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3459] Cellobiose phosphorylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.514428 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATACTAA AAACATTGTC TGCTCCAATC ATCCTAGAAA ACAGTAACAG TACTTTTACT TTCTTACCAG GAGGAGACAA CTTTGAATGG ATTCATGAAT CCATTATGAT CAATGCTTTT CAAGGTAATA CGCTTGATGG CTCTACCAAC AATTTGTATC TTCGAATATA CAAAGATAAT TCCCTTGCTT TCTATCCTCT CATCGGGATG AACTCCAAAA GCACTATAAA ATCCGGTACA TCAACACTTA TTTTTGAAGG AACGGCTGAG GATATTTCCT ATACCGTTAC TTTTCGTTTA ACCCCATACG GTATTTGGTT CTGGGACATA TCTCTGTCTG GAAATTGCAA CAAAGCAGAT ATTATCTATT CCCAGGATAT CGGTGTCGGT ACAAAGGGAA GTGTTAATTC CAATGAATTG TATTTGGCAC AATATTTAGG TCATAGCATC TTTCAAGGAG ATTATGGCTA CGTGATATGT TCCAGACAAA ATATGGCTCA GGGTGATTTA TTTCCATACC TCCAACAAGG TAGCTTAGGT ATTCGTTCCA TTGCTTATTC TACCGATGGT ACACAATTTT TTGGTCTATC TTATAAAAAA ACAAATATCC CTGAAGCATT GTACGGGGAT CTTCCAAGCA AAAATAAGCA GTATGAGTTA GCTCATACCG CCCTTCAAAC CGAAGCCTTT TCGCTCTCTG GAACAAAACA ATTCTCTTTT TATGGTATTT GTAAAACTAA TCATCCGGAA GTAATCCGTG AAATAGAGTA CATACAGGAA TTAGAAAAAG CCTATGCCTA CCATGAATCT GGAGAGATCT TGCCTGTTAA TGTTCCTACC CTTCAAAACA TAGGGGCTCC TTATGCTTCT TCTCGTTGGG ATGCCAAACA GGTAGAACAT TATTTCCCAA AGAGACTTCT TGAGGAAAAA GAGGAGGAAG CATTACTCTC CTTTTTCACA CCAGAAAAAA GTCATGTTGT ATTACAAGAC AAAGAACTAA CAACAGAACG TCCACACGGA CATATTCTTA TGACAAATTT TGATGTAACT AAGGTTCCTC AAGGTGTAGT AAGTTCTACA AACTATATGT ATGGTGCATT TAACTGCCAG TTTGTAGTAG GAAATACAAC TTATAATAAA CTGCTATCCA ATCACAGAGG TTTATTAAAC ATCCAAAAAG ACAGCGGTCA GAGAATCTTT ATTAAAATTG GCGACTGCTA TAGACAACTT ACCTTACCTG CTGCCTACGA GATGAATGTT GCGGGCAGTA CCTGGTATTA TCAGCTAGAC GAGGATGTCT TAATAATTAC CTCCTTTGCA ATGTATAACC GACCGGAAAT TGTACTAAAA GTTCAGTCTC TTGGGCATAA AAAATATGAT TTTATTGTTA CTCATCAGCT TACGGTAGGA CCGAATGAGT ATGAAAATGA AATCAAATTG ACAAGAGAGG GTAATATACT ACAGCTTTCC CCAACCGATC CTGTTGTTAC GAATCATTTT TATCCTGAAC TTTCGTTCCG AATGAGAATA CCAGAGGACT GTACGTTATC AGACGATAGC ATTTTCTTTC ATAATAATAC GACGATTAAT CCATCTTTAC TATCGATAGA AATACTTCAA AAATCCAGCT TTGATATTGT AATGCAAGGA TTTGATACAG GAAATGTAAT TCCATTCTTA GATCAGTATG ACTATAAAGA ACAGCTTGAA GCTTATCGCA TTTACTATGA TCAATTGGTT TGTAATTTTA AATTAAGTGC TCCAGACAAA ATTCCACTTT CAGCAGAGAA ACTAAATGCA ATCATACATT GGTATGCTCA TGATGCCTTA ATCCATTTTG CATCTCCACA TGGTCTTGAG CAATCCGGTG GTGCAGCATG GGGTACACGA GATGTATGCC AAGGCCCGAT AGAATTTTTC TTAACTACAG GGCATTTTGA CCTCGTACGA CATATTCTGA TAACTCTTTA CTCCCATCAG ATAGAGGGGG GATTCGAGTG GCCTCAGTGG TTTATGTTTG ATCATTATCC TATACATCAA GAAGATTGTC ATGGGGATGT TGTATTTTGG CCATTAAAAG CTATCAGTGA TTATATACAG GCAACCGGAG ACACCTCCAT TCTAAATGAG CTTGTAGATT ACCGCACTGC GAAAGATGCT TTGCCTACCA ATCAGCCGGA AACAATACTG ATTCATATCA AGCGTGCTGT CACTACAATA AAGAATCGTT ATCTATCCGG GACAGCGCTA ATTTCCTACG CTGGTGGAGA CTGGGATGAT ACCTTGCAAC CAGCCAATAG TGAATTAAAA GAAAATCTGG TAAGCGCCTG GACACAAGCT CTTGCAGAAC AGACCTTAGA ACTTCTTTGC AGTGCTATTA AGGGTATCGA CCATGATTTT TCGAAAGAAT TATCTCATAT GGCAAACGAC ATCAGAACAT CCTTCTATCA ATATCTGATA AAAGACGGGG TTATCGCAGG TTTTCTTTAT AGAGAATCCG AAGAACATAT GAAGTATATG TTGCATCCGG ATGATACGGA GTCATCCATC CATTATCGAC TACTGCCACT AACCAGAAGT ATTATAGCAC AATTGGCTGA TTTTAAGTTA GCAACTCGTA ATTTAGAAAT TATCGATGAA CATCTGGCAT GTCCGGATGG CGTTCGTTTG ATGGACCATC CAGCTAGTTA TAGCGGTGGA ATCAGTAAAA TATTCTTACG TGCAGAACAA GCAGCAAATG TTGGTAGAGA AATCAGCCTG CAGTATGTGC ATGCTCACAT TCGCTATATA GAAGCATTAG CAACTATGGG CCTTTCTAAA AAAGCATGGG ATGCCTTAAT GCGTATTAAT CCGATCTTAT TGACAGATTA TGTACCAAAT GCCCTTACTC GTCAAAGTAA TGTTTACTTT AGCAGTTCCG AAGGATGCTT TGATGATCGT TATGAATATG CAAAAAACTT TGATAAACTA AGAACCGGAG ATATTAATGT GAAAGGTGGT TGGAGGCTCT ATTCCAGCGG TCCTGGTATT TATATCCGAA GAATCATTGC AGATTTGCTA GGAATTCGTT TCGGCCATAA TGTTATCCAC ATCGATCCTG TTGTTACGAA AGAATTAGAT GGCGTAACGC TACAATTTAC TTGCTTTGGA AAGACAGTTT TCTTTACCTA TCATGTCGAT GACACGATGG ATAAACATAT TTGTGTTAAA TCAAATAATA ACATACTGCC TGGAGACAAC TTAAACAATA TCTACAGAGA CGGTGGCATT CAGATTGCAA AAGATGTTTT CTTATCTGCT GCTATGAGCG ATAATAATTT TCATATTTAT GTTAAGAACT AA
|
Protein sequence | MILKTLSAPI ILENSNSTFT FLPGGDNFEW IHESIMINAF QGNTLDGSTN NLYLRIYKDN SLAFYPLIGM NSKSTIKSGT STLIFEGTAE DISYTVTFRL TPYGIWFWDI SLSGNCNKAD IIYSQDIGVG TKGSVNSNEL YLAQYLGHSI FQGDYGYVIC SRQNMAQGDL FPYLQQGSLG IRSIAYSTDG TQFFGLSYKK TNIPEALYGD LPSKNKQYEL AHTALQTEAF SLSGTKQFSF YGICKTNHPE VIREIEYIQE LEKAYAYHES GEILPVNVPT LQNIGAPYAS SRWDAKQVEH YFPKRLLEEK EEEALLSFFT PEKSHVVLQD KELTTERPHG HILMTNFDVT KVPQGVVSST NYMYGAFNCQ FVVGNTTYNK LLSNHRGLLN IQKDSGQRIF IKIGDCYRQL TLPAAYEMNV AGSTWYYQLD EDVLIITSFA MYNRPEIVLK VQSLGHKKYD FIVTHQLTVG PNEYENEIKL TREGNILQLS PTDPVVTNHF YPELSFRMRI PEDCTLSDDS IFFHNNTTIN PSLLSIEILQ KSSFDIVMQG FDTGNVIPFL DQYDYKEQLE AYRIYYDQLV CNFKLSAPDK IPLSAEKLNA IIHWYAHDAL IHFASPHGLE QSGGAAWGTR DVCQGPIEFF LTTGHFDLVR HILITLYSHQ IEGGFEWPQW FMFDHYPIHQ EDCHGDVVFW PLKAISDYIQ ATGDTSILNE LVDYRTAKDA LPTNQPETIL IHIKRAVTTI KNRYLSGTAL ISYAGGDWDD TLQPANSELK ENLVSAWTQA LAEQTLELLC SAIKGIDHDF SKELSHMAND IRTSFYQYLI KDGVIAGFLY RESEEHMKYM LHPDDTESSI HYRLLPLTRS IIAQLADFKL ATRNLEIIDE HLACPDGVRL MDHPASYSGG ISKIFLRAEQ AANVGREISL QYVHAHIRYI EALATMGLSK KAWDALMRIN PILLTDYVPN ALTRQSNVYF SSSEGCFDDR YEYAKNFDKL RTGDINVKGG WRLYSSGPGI YIRRIIADLL GIRFGHNVIH IDPVVTKELD GVTLQFTCFG KTVFFTYHVD DTMDKHICVK SNNNILPGDN LNNIYRDGGI QIAKDVFLSA AMSDNNFHIY VKN
|
| |