Gene Cphy_1711 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_1711 
Symbol 
ID5741462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2101420 
End bp2103282 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content35% 
IMG OID641292811 
Productendopygalactorunase-like protein 
Protein accessionYP_001558822 
Protein GI160879854 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA GAATAGACAA TATTTATATG CCAATCGATA TAGAACCTAA TAAAAATTTT 
TCTGTAAAAG TTAGAAGTGA AAACGATCAA CAGTGGCAAG AATTATTTGT TTACAATGTG
AGGGTAGGCC ACCAACAAAC ACCTTATGTA AACAGTGGCA TGGTCAAGTT TGATTTTGAA
GGAGCCATAG AAATTAGTAT AGATTATAAT GTCAGTGATA TAGCATCCTA TGAGATAAGA
CCAACCTCTT ACCATATTAG TGGGAAGCAG GAAGAAAGAA ATATTAAATT CAAGTTACAT
CAGGATGGAG AGAATTCTAA AAAATTAGTT GTAAGAATAA ACGATAATTG GGAGACCGCT
TGTTTGCACA TTTTGTCAAA TCCTATAGAG GAAGAAAAGC CAGTTAAGTA TGCTGAAAAT
ATTCACATAA TAAAGGCAGG TGATGAAATT CCTTTTTATT TACCAAAAGG GAAGGATACC
TATTATTTTG AAGAAGGAAT ACATGTTCTG CCTGGAGGCT TATGGATGGA ACATGATTTA
AAGAGAGTAT ATACAATCGA CAGATTTTTA ATAGAGCAAA GCCCTATTGT TTTATTAGGA
TACGCCGATG GATTAAGTTG TGAGATGCCA CAAAAATACA TAGTGGAAGG CAAAGAAACA
GAGGAAGAAC ATTATAAGAT ATTATTTGAT GGTAGAGATA ACTTGGCGCT TGGCATGATA
GAAGAAAAAA TTGCGTCTAT TAATGTAAGA TATGTAAGAA TACGTTTACT GGGAAGTATA
GGTGAGCGTT TTAGATACTC CAATGCGATA AAACAATTTA GAGTATATAA GGAGAATAGT
CATGAGGATT TAACAGTTCA GGCAGAGACA AGAGCAGCCA CTCCTAGTAT ACTGAATGGA
AAAGGAGTCT CTGAAACAGG ATACAGTAAT TGGCATGCAG CAGAAAGCTT TTTCTTGTGT
CAGGATCATT ATAAAGTGTA TTTAGCAAAT GGATCTGTTG TTAAGGGGGC ATTTGCGTCA
GATGAAGTCA ACCATATTAA AATATATGGC AGAGGTATTT TGGATTGCAC AGAGCTTAAA
CATTTTTTTA GGGTAGGGAG TGAAGATCGT ACAGGTGCTA TATGGCTTAT TAGTGGAAAG
AATTTAGAAG TAGAAGGAAT CACTGTATTA GATCCTCCTA TGTGGTCAAT CGTGTTAAAT
AATGGTGAAA ACATTAAGGT TAGAAATGTA AACCTCATAG CATCAGCGTT AAATGCGGAC
GGTATACATT TTAGCAGTAG TTCCAATGTT GAAATAGAGA ATTGTTTTAT AAGAACTTGT
GATGATTTGA TTGTATTGTA TCATTATGGA AAAGCACAGA ATATTACCGT AAAAAACTGT
GTATTATGGA GTGATGATGG TCATGCGTTT TTGTTTGGTC TAGGAAGTGT GAAAGATGCC
CCTATAAAGA ATATAAAAGT ATATCAATGT GATATTATTG ATCATAGAGC AGCCTGGGAT
TTTATTAAAT ATTCCGGTGC AATTAAGTTG TGGCCAAACG GGGGAAATCT TATGGAGGAT
GTTGTGTTTG ACACGATTAA TATTGATAGT TTTCAAATGC CAGAGAAAGC ATCCGTATTT
AAATTAACTA CTCATGAACG CCTTGAAAAT GAGGGGCATG GCATTTTAAA GAATGTTCTA
CTAAAAGACA TATATTATTG GGGATCAGGA GAGCAAAATG CATTAATCCA AGGAGTTAAT
GAGGCATTTC ATATTGAAAA TGTCAAGATA CAAAACTACT GTAGAAACGG TGTGAGAGTG
AAGGATACGA ATGATGGGCA CATTACAGTA AGTGGTTGTG TTAATGGGTT AACAATAGAG
TGA
 
Protein sequence
MKNRIDNIYM PIDIEPNKNF SVKVRSENDQ QWQELFVYNV RVGHQQTPYV NSGMVKFDFE 
GAIEISIDYN VSDIASYEIR PTSYHISGKQ EERNIKFKLH QDGENSKKLV VRINDNWETA
CLHILSNPIE EEKPVKYAEN IHIIKAGDEI PFYLPKGKDT YYFEEGIHVL PGGLWMEHDL
KRVYTIDRFL IEQSPIVLLG YADGLSCEMP QKYIVEGKET EEEHYKILFD GRDNLALGMI
EEKIASINVR YVRIRLLGSI GERFRYSNAI KQFRVYKENS HEDLTVQAET RAATPSILNG
KGVSETGYSN WHAAESFFLC QDHYKVYLAN GSVVKGAFAS DEVNHIKIYG RGILDCTELK
HFFRVGSEDR TGAIWLISGK NLEVEGITVL DPPMWSIVLN NGENIKVRNV NLIASALNAD
GIHFSSSSNV EIENCFIRTC DDLIVLYHYG KAQNITVKNC VLWSDDGHAF LFGLGSVKDA
PIKNIKVYQC DIIDHRAAWD FIKYSGAIKL WPNGGNLMED VVFDTINIDS FQMPEKASVF
KLTTHERLEN EGHGILKNVL LKDIYYWGSG EQNALIQGVN EAFHIENVKI QNYCRNGVRV
KDTNDGHITV SGCVNGLTIE