Gene Cphy_3749 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3749 
Symbol 
ID5742948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4599498 
End bp4601201 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content36% 
IMG OID641294861 
Productglycoside hydrolase family protein 
Protein accessionYP_001560835 
Protein GI160881867 
COG category[R] General function prediction only 
COG ID[COG3858] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.634589 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA GAACAAAGCT TGCATTTATT GGAACTATCG CAGTAGTGGT TGTTGTACTT 
GCGATTGTTG TATCTTATAT CGTAGAGAAA AATACACCAA GTAAAGAAGT AAAAGCACTA
TCGGAATTCT ATCAAGTACC GGAAGGGGAA GCGATGGTCA TCATGGATGA TATCGTTTAT
GAGAGAAATG CAAAACTTCT GGATGGTGTT TTATATATGG ATCTTGAAAC CATAAAAGTT
AAGTTCAATC AGAGGTTTTA TTGGGATGCG ATAGAAAATG TATTAATCTA CACAACGCCA
ACAGAAATTA TTAAGGCAGA GGTAGGTACA AAAGATTATC TTGTAAATAA AAATAAAGTA
TCATCAAATT ATCCAATCGT AAAGATGGTG AATACAGAAG TTTATGTAGC ACTACCCTTT
GTTGCAGAAT ATTCAGATAT GCGTTATAAA GCTTATGAAA ATCCGGATTT AGTTGTAATT
CAGTGTAAAT GGGGAGATTA CTTATTTGCT GATGTAGAGA ATGCAACACA GATAAGAACA
GGAGCATCCA TTAAAAGTCC CATATTAAAA GAACTAAATA AAGGGGATCG TGTTTTATTA
ATCAATAATG GCGGAAATCA ACAAAATGGA TTCTTAACCG TTATGACAGA AGAAGGAATT
AGGGGATTCG TCCGTAAAAA GAACCTTTCA AATTCTTACT ACGATAAGGT AACAAGTAAC
TTTGAAGCTC CTGTTTATGA AAGTATCACG AAGGATTATA AGATTAATTT AACTTGGCAT
CAGGTTACCA ATCAAGAGGC TAATAAAAAG TTAGCAGAGG TATTAGATTC TACCAAGGGT
GTTACGACGA TATCGCCAAC TTGGTATCGT ATTAATTCGG CAGAAGGAAC ATTAGCCTCT
TTGGCAAGTG AAAGCTATAT AGAAAAAGCT CATAGTATGG GAATTGAAGT TTGGGCCCTA
GTGGATAATT TTGATCCTAC TGTTGATACG TTTGAAGTAT TATCAAAAAC TTCAAGCAGA
GAGCGTTTGA TTAATGAATT GATAGCACAA GCCATAAAAT ATAACCTCGA TGGTATTAAT
ATTGATTTTG AGAGTTTATC GGTAGAGACT GGCCCACATT ACATTCAATT TTTACGTGAA
TTATCTGTCA AATGCCGAAG CAACCAGATT GTATTATCTT CCGATACTTA TGTTCCTGCA
TCTTACTCTA AGTTCTATGA TAGACAAGAA CAGGGGGCAG TACTTGATTA TGTTATAATT
ATGGCATATG ATGAGCATCA CAGTAAATCA GAAGAAGCTG GTTCTGTTGC ATCTATCGGA
TTTCTACAAA AAGCAATCGA AGATACGCTA CTCCAAGTGC CAAAGGAAAA GCTTATTATG
GGAATACCAT TTTACGCAAG ACTATGGAAG GAATATACGG AACTTGGTAA TCCAGCACTT
GCTTCAGAGG CAGTTAGTAT GACGAGTGCT GAAAAAACGT TAGAAGCGAA CAAAGCAACG
AAGAGCTGGG ACCAAACGAC AGGACAATAC TATGCTGAGT ATGAGAAAGA TGGTGCGAAG
TATAAAATCT GGCTAGAGGA AGAGGAATCC ATCGAAGCTA AGTTGAAACT TATCTCTGAA
GCTGATTTGG CGGGTGTTGC AAGTTGGAGA TTAGGATTTG AAAAACCTAG CATATGGAAT
GTAATTCAGA AATATGTGAA TTAG
 
Protein sequence
MKKRTKLAFI GTIAVVVVVL AIVVSYIVEK NTPSKEVKAL SEFYQVPEGE AMVIMDDIVY 
ERNAKLLDGV LYMDLETIKV KFNQRFYWDA IENVLIYTTP TEIIKAEVGT KDYLVNKNKV
SSNYPIVKMV NTEVYVALPF VAEYSDMRYK AYENPDLVVI QCKWGDYLFA DVENATQIRT
GASIKSPILK ELNKGDRVLL INNGGNQQNG FLTVMTEEGI RGFVRKKNLS NSYYDKVTSN
FEAPVYESIT KDYKINLTWH QVTNQEANKK LAEVLDSTKG VTTISPTWYR INSAEGTLAS
LASESYIEKA HSMGIEVWAL VDNFDPTVDT FEVLSKTSSR ERLINELIAQ AIKYNLDGIN
IDFESLSVET GPHYIQFLRE LSVKCRSNQI VLSSDTYVPA SYSKFYDRQE QGAVLDYVII
MAYDEHHSKS EEAGSVASIG FLQKAIEDTL LQVPKEKLIM GIPFYARLWK EYTELGNPAL
ASEAVSMTSA EKTLEANKAT KSWDQTTGQY YAEYEKDGAK YKIWLEEEES IEAKLKLISE
ADLAGVASWR LGFEKPSIWN VIQKYVN