Gene Cphy_2058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2058 
Symbol 
ID5743758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2542002 
End bp2543411 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content33% 
IMG OID641293155 
Productcellulase 
Protein accessionYP_001559165 
Protein GI160880197 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.33329 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGTGTTG AATTGATGAG AAAGTTTCCA ATTATAGTAT CAGGTATTTT AGTATGTGCA 
ACATTATTAA GTGGCTGTAG CCAAAATATG TCAAATGTTG ATAAATCCAC AGTGAATTTG
AATTCGCAGA ATACAGACAA TCAAAATCAA GATGCAAATA ATAAAGACTC GGATAATCAA
GGCGCAAATA ATCAAGAGTC AAATAATCAA GAATCACAAA ACCCTGAATT AAAAAATACA
CCAACACCTC AAATCATAGT ACTACCAGGT GACAGCACAA TTGAAGCAAA CGAAATTGTA
GCCCCAACCA TCACATTTGA ACAAAAAGAA ATACCATCGA ATCCAGCAAT AACTTTTGTA
CATAATATGA AAATAGGATG GAACCTTGGA AATACATTTG ACGCAGTGAG TGATTCCAAT
CTAATGGATG AACTTAATTA TGAAAGCTCA TGGTGTGGTG TAAAAACAAC AGAAGAGATG
ATGAAAGCAA TTAAAGATGC TGGGTTTCAG TCGATTAGAA TACCAGTATC GTGGCACAAT
CATGTTTCTG GTGATGATTT TATTATAAGC GAAGTATGGC TTAACCGAGT ACAAGAAGTG
GTCGATTATG CTATCAATAA TGATATGTAT GTGATATTAA ATACTCACCA TGATGTAAGT
AAAAATTTTT ATTATCCAAG TAATGAAAAT TTAGAATCTT CTAAAAAATA TATCAACGCA
GTATGGACAC AAGTAAGTGA ACGATTTTCT TCCTATGGAG AAAAGTTATT ATTTGAAGGG
ATGAACGAAC CAAGGCTTGC AGGTTCTAAT TACGAATGGT GGTTAGATTT ATCAAAGCCT
GAGTGTAAAG AAGCAATCGA ATGTATTAAT CAATTAAATC AGGAATTTGT TGATACCGTT
CGCAAATCGG GAGGAGAGAA TACTTCTAGG TATCTTCTGA TACCAGGGTA TGATGCATCG
TCTCAATATG CACTTATTAA TGATTATAAG TTACCAAAAG ATAATATAAA TGATCGTTTA
ATTGTATCAG TACATGCATA TCTACCATAT GACTTTGCCC TAAAAAGTCC AAAGGAAAGT
GGCAGTATAT CAGAATGGAA TTCAAAAATA GCTGGATGTA CTAAGGAGAT AGATTCTTTT
TTAAATAGTT TATATATGAA GTTTATAAAA AATGGAGTTC CCGTAATTAT CGGTGAATTT
GGTGCCAGAG ATAAAGAAAA TAATTTAGAA TCTCGGGTAG AGTATGCTAC TTATTATATA
GGTGCTGCAA AAGCGAATGG AATCACATGC TTCTGGTGGG ATAATCATGC ATTTAAAGGG
GACGGGGAAA ACTTCGGTCT TTTTGATAGA AAGAGTTGTA CTATAAAATA TCCTGAGATA
TTGCAAGGTT TAATGAAATA CGCAGAATAA
 
Protein sequence
MCVELMRKFP IIVSGILVCA TLLSGCSQNM SNVDKSTVNL NSQNTDNQNQ DANNKDSDNQ 
GANNQESNNQ ESQNPELKNT PTPQIIVLPG DSTIEANEIV APTITFEQKE IPSNPAITFV
HNMKIGWNLG NTFDAVSDSN LMDELNYESS WCGVKTTEEM MKAIKDAGFQ SIRIPVSWHN
HVSGDDFIIS EVWLNRVQEV VDYAINNDMY VILNTHHDVS KNFYYPSNEN LESSKKYINA
VWTQVSERFS SYGEKLLFEG MNEPRLAGSN YEWWLDLSKP ECKEAIECIN QLNQEFVDTV
RKSGGENTSR YLLIPGYDAS SQYALINDYK LPKDNINDRL IVSVHAYLPY DFALKSPKES
GSISEWNSKI AGCTKEIDSF LNSLYMKFIK NGVPVIIGEF GARDKENNLE SRVEYATYYI
GAAKANGITC FWWDNHAFKG DGENFGLFDR KSCTIKYPEI LQGLMKYAE