Gene Cphy_3310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3310 
Symbol 
ID5741589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4028993 
End bp4030549 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content35% 
IMG OID641294411 
Producthypothetical protein 
Protein accessionYP_001560403 
Protein GI160881435 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAAA AGTTTCGTAA GTTGGAAGGT ATCGCAATTG TAGTGTTATT CGTATGTCTG 
ATAGGTCTTT CGGTTATGTT AGTGAATAAG AAAAGGCCAG CGAAGGAGGT GATTTCGGTT
CAGGAAACAA ATCTAGTTTT GTATGAAGGC CCAAAGTCTT TACGTGATGC TACCAAGGAG
GATTTGGTTT CTGCAAACGA GAGCTCTAAA GATTTTTCAC TTCTTCATTG TACCGATACT
AAAATAATGG TAAATGGATA TGACTGTTAT GTATATGATA CAAATGTAAA TCATTCTAGA
AGATGGTATA ACGATTACAT GCCTCCAAAT GCAAGAACTC CAATTTCCTA TTTTGATTTT
GAAGGTATCG TAGAAGTAAC AATTAAGGTA CCTAATATTG ACCTAGAAAC AGCTAGCGTA
AGTCCATTGC AATATGGAAT TATACCAGTA CTTGATGTAG AAAACCATAC CGTTACATTT
ACAATTACAA AACCTGACCA ATATACCATA ATGTTTAATA ACTCACCGGA AAGAGCGGTA
CATTTATTTG CGAATGAAAT AGAGACGAAT ATACCATCCA AAGATGATAA AGATGTTGTA
TATATTGGAC CTGGAGAATG GAATATTGAA AACATTATTC TTGAGGATAA TCAAACACTA
TATATCTCGG GTGGTGCTGT AATTCATGGC ATAGTAAATG CTAATTTTGC TAAGAATATC
ACCGTTCGCG GAAGAGGAAT TATCGATGGT TCTAAATTTA ATGGATGGAA GGGAAAAGAA
GCATACATAC CATTAAAGTT TGATAACTGC GAGAACATCA CGATTAAAGA TATTATAGTA
TTAAATCCGA ATGCTTGGGT ATGTCAGGCA TTTAATTCTA AAAACGGTAC TATAGATAAT
ATAAAAATTA TATCCTCAAG GCCAAATGGG GATGGTATCA CTTTACAGTC CTGCCAAGAT
TACACCGTTA AAAATGGTTT TGTTCGAAGC TGGGATGATT CTTTGGTAAT TAAAAATTAT
GATGATAACT CAAAAAATAT AAAGTTTGAA AATATGCAGC TATGGACGGA TTTTGCGCAG
TCTATGGAGG TTGGATATGA AACGAATAAA GGCAAACGAG AGAATGCTTT TATCTCTGAT
ATCACATTTG AGAATATTAC GGTGCTTCAT AACTTCCATA AGCCTGTAAT ATCAGTTCAT
AATGCAGATG ATGCAGCGAT AAGTGGAATT ACTTTCAAAA ATATTGTAGT TGAAAATGCT
CAGATGGGAA GTGGCGATGG TGACGAGATG CCATACCTCA TTGACATTAA TATTGCCGGG
AGTTCTAATT GGTCCTCAAC TAGAGACCGA GGAACTATCA AAAATGTCGT AATTGACGGA
GTAGATGTGC TTGGTGGTAA AAACTGCTCT TCAAGAATTA AAGGCTTTGA TGCAGAGCAT
AACATTGATG GTGTTATTTT AAAAAATATC AATGTGTTAG GAGAGAAGAT CACGAATTTG
GAACAAGGAA AATTTGAAGT AGATGAAAAG ACTACAAAGA ATATTATCCT TGAGTAA
 
Protein sequence
MSQKFRKLEG IAIVVLFVCL IGLSVMLVNK KRPAKEVISV QETNLVLYEG PKSLRDATKE 
DLVSANESSK DFSLLHCTDT KIMVNGYDCY VYDTNVNHSR RWYNDYMPPN ARTPISYFDF
EGIVEVTIKV PNIDLETASV SPLQYGIIPV LDVENHTVTF TITKPDQYTI MFNNSPERAV
HLFANEIETN IPSKDDKDVV YIGPGEWNIE NIILEDNQTL YISGGAVIHG IVNANFAKNI
TVRGRGIIDG SKFNGWKGKE AYIPLKFDNC ENITIKDIIV LNPNAWVCQA FNSKNGTIDN
IKIISSRPNG DGITLQSCQD YTVKNGFVRS WDDSLVIKNY DDNSKNIKFE NMQLWTDFAQ
SMEVGYETNK GKRENAFISD ITFENITVLH NFHKPVISVH NADDAAISGI TFKNIVVENA
QMGSGDGDEM PYLIDINIAG SSNWSSTRDR GTIKNVVIDG VDVLGGKNCS SRIKGFDAEH
NIDGVILKNI NVLGEKITNL EQGKFEVDEK TTKNIILE