Gene Cphy_2025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2025 
Symbol 
ID5743053 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2500417 
End bp2502339 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content35% 
IMG OID641293122 
Productglycoside hydrolase family protein 
Protein accessionYP_001559132 
Protein GI160880164 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00409776 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGCAG ATTTAACATG GTTGGATAAC CCAGAAGTGT TTCGTGTCAA TCAGTTAGCA 
GCTCACAGTG ATCACCCTTT TTACAAGAGT AAGGAGGAGA TGGAACTAAG TACTTCCGAG
GTTCCCTTTA GTCAGAATTT TACCGAGAAA AACTCCTTAC TACAATCGTT AAATGGTACG
TGGCAGTTTC GATATTCTGT GAGTGCAAAA GAAAGACCAG AGTATTTTTA TCAAGAAGAT
TTTATCGGAA GCGACTTCGA TGAAATTGCG GTTCCATGTC ATATTGAGTT AGCAGGTTAT
GATAAAATAC ATTATATTAA TACGATGTAT CCATGGGAAG GCCATTACTA TCGAAGACCG
GCGTATTGTT TAGGAAATGA TAGTTTGAGA GGAACCTTTA GTGAGGCTTC TTATAATCCG
GTTGGATCTT ATCGGAAACG CTTTGATATA GAGAAAGGCT TACTTGGAAA AAGAGTGTGC
ATAAGTTTTG AAGGCGTAGA ACAAGCAATG TATATCTGGT TAAATGGGCA ATTTATTGGT
TATGCAGAAG ATAGTTTTAC CCCTTCCGAA TTTGATTTAA CACCTTACAT AAAGGAAAAA
GATAATATTT TAGCGGTAGA AGTTCATAAA AGAAGTACGG CCGCTTTTCT TGAAGATCAG
GATTTCTTCC GATTCTTTGG TATCTTTCGA AATGTTACCT TGTATGCAAA ACCTGAGATA
CATATTGAAG ATATGTGGGT TAAACCACAA TTAAATGAAG ATAACTCTTC AGGAAAACTT
GGACTTGAGT TAAAGATATC GAATCGGATT GAATTAGGGA CGATAACACT TAAGGTTCAA
GATGATAATC ATAAAATTAT ACTTGAAAAA AGCATAGAAG CGAAAGAAAA AGTTAGTTAT
GAATCAAATG TAATAGAGAA TATTATTCCT TGGTCATATA AAAATCCATA TTTATATACG
GTAAATCTTG AAGTATATGA TAACAGTGGA TGCTTAGTTG AAATTGTACC ATATAGAATA
GGATTTCGAA GGATTGAAAT CAAGGATAAA CTTATACTTC TCAATGGAAA ACGACTAATA
ATTAATGGTG TGAACCGCCA TGAGTGGAGT CAGACCAGTG GAAGATGTAT TAATTTATCT
GATATGCAGA CAGATATGAA ACTGATTCTT CAAAATAATA TTAATGCTGT TCGTACCAGC
CATTATCCAA ATCAGATACC TTGGTATTAT CTTTGTGATG TTAGTGGCAT CTATGTCATG
TCTGAGACGA ACTTAGAATC TCATGGCTCA TGGCAAAAGT TGGGAGCTAT AGAACCTTCT
TGGAATATGC CGGGTAGTAT TCTACAGTGG AAAGAAGCCG TAGTTGACCG TGCGAGAACT
AATTTTGAAA CCTTTAAGAA TCATACCTCA ATACTATTTT GGTCCCTAGG GAATGAATCC
TATGCAGGAG ATAATATCGA GAGTATGAAT CAATTCTTTA AAGAAAATGA TGCTAGTAGA
CTTGTACATT ATGAAGGAGT AGTGAATAAC AGAGCTTATG AATCTACTAT ATCCGATGTA
GAGAGCCGTA TGTATGCGTC TTCTAAACAA ATAGAAGAAT ATTTGGAATC AGATCCAGCA
AAACCTTTTA TTTTATGCGA ATATATGCAT GATATGGGAA ATTCTTTGGG TGGGCTGAAA
TCATATATTG ACTTGATACC TCGTTATGAG ATGTATCAAG GTGGATTTAT TTGGGATTTT
ATCGATCAAG CATTATTAGT AAAGGATGAG ATATCAGGTG AGATGGTACT TCGATACGGA
GGAGACTTCG ATGACAGACC TTCTGACTAT GAATTCTCAG GAGATGGAAT CGTATTTGCA
GATAGAACAG AAAAACCATC AGTACAGGAG GTGAAATATT ACTATGGATT GTATCAATCA
TAA
 
Protein sequence
MKADLTWLDN PEVFRVNQLA AHSDHPFYKS KEEMELSTSE VPFSQNFTEK NSLLQSLNGT 
WQFRYSVSAK ERPEYFYQED FIGSDFDEIA VPCHIELAGY DKIHYINTMY PWEGHYYRRP
AYCLGNDSLR GTFSEASYNP VGSYRKRFDI EKGLLGKRVC ISFEGVEQAM YIWLNGQFIG
YAEDSFTPSE FDLTPYIKEK DNILAVEVHK RSTAAFLEDQ DFFRFFGIFR NVTLYAKPEI
HIEDMWVKPQ LNEDNSSGKL GLELKISNRI ELGTITLKVQ DDNHKIILEK SIEAKEKVSY
ESNVIENIIP WSYKNPYLYT VNLEVYDNSG CLVEIVPYRI GFRRIEIKDK LILLNGKRLI
INGVNRHEWS QTSGRCINLS DMQTDMKLIL QNNINAVRTS HYPNQIPWYY LCDVSGIYVM
SETNLESHGS WQKLGAIEPS WNMPGSILQW KEAVVDRART NFETFKNHTS ILFWSLGNES
YAGDNIESMN QFFKENDASR LVHYEGVVNN RAYESTISDV ESRMYASSKQ IEEYLESDPA
KPFILCEYMH DMGNSLGGLK SYIDLIPRYE MYQGGFIWDF IDQALLVKDE ISGEMVLRYG
GDFDDRPSDY EFSGDGIVFA DRTEKPSVQE VKYYYGLYQS