Gene Cphy_3820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3820 
Symbol 
ID5744772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4676284 
End bp4677741 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content38% 
IMG OID641294932 
ProductPGAP1 family protein 
Protein accessionYP_001560906 
Protein GI160881938 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0412] Dienelactone hydrolase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.233276 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGG TATTATTGGG TTTGGTCTGC AGTCTATTTA TCCTAAGTAT GGTCTCGTGT 
AAAGGAAAGA GTAATCAAAC TGACCAGACA ATAAGTGATG GGGAACTGAC AGTGACACCA
ACGATAACAG AAGCAAACAT TACGGAAATA CCTAACGAAA CACCGACACC GGGTCTTATC
GACTCTGTAG AATTTAGTGA GAATGGTTTG GAGGATCTTT CAGCAAAGTT AACGCACGAT
TTATTAACAG AGAATTTTGA AGAAGTCTAT TCCTATCTTG AGGATACAGT CAAAGAGCAA
TTATCATTGC CAGACTTAGA GAAAGCCTTT CATAGCACTG TTGAGCGAAT TGGTGAGTTA
GTTGATGCTA TCTCTATAAA GGCTACAACA ACGGGTGAGT ATATCTCAGT AGATAGTTTA
GTAGAATATA CAGAGAATGG TTTAAAGATA TCTTATGTCT ATAATAAAGA CTGTAAGTTA
GTAAAACTAT GGTTCTCTTA CCAACCAATT GAGGAAGAAT ACGATCGTGA AAAGATGGAG
GAAATCGACA TCACTATAGG TATCACGAGC GGTGTCACTG TTGGTGAGGG AGAATTTCCG
CTTGATGGTA TTTTAACGAT GCCAAAAGGT ATAAAAAATC CACCTGTCGT AGTCTTAGTA
CAGGGGTCAG GACAGAGTGA TATGGATGAA ACCATTGGCG GAACAAGCAA TAAGCCATTT
CGTGATATCG CAAGAGGATT AGCAAGTGAG GGGATTGCCT CTATCCGTTA CAATAAAAGA
TTCTATCAAT ATATGGATCA AGCCTCCGAT ACAATGACAA TTTATGATGA GGTATTAGAG
GATGTTACTT ACGCCATTCA ATATGCTAAG AGTCTAACAA ATGTAAATAC GGAAAAGATA
TTTGTACTTG GGCATAGTTT AGGAGGTATG TTATGTCCAA AGATAGCGGA AGATAATTCG
GATATCGCAG GATTTATCTC CTTAGCAGGA AGTCCAAGAA AATTAGAAGA TCTATTACTT
GATCAATCGA TTGAGGCGGT AGAGAATGGT ACGGTGAGTG AGTCGGAGAA AACACTCTAT
CTAGATACTA TGAAGGCTCA ATATGAGCAG ATTAAAAGCC TTACAGAGGA AAACCTAGAT
GAGCCACTTC TTGGGGCAAA CGGATACTAC TGGAAGAGTT TGAATGATAT AGATACTCCT
AAAATCGTAG CAAATCTAAC CCTGCCAATG TTATTTATGC AGGGAGAAGC AGATTTTCAA
GTGTATCCTG AAGTAGATTT TAAGATGTGG AAGGATCTAC TACAAGAGAA AGACAATGCA
ACATTTCAAT TGTATGAAGG CTTAAATCAT TTATTTATGC CAACAACAGG AGTACGTGAT
ATAAGCGACT ACAGCGTAAA GAACAAGGTA GATGATAAAG TGATTCTAGC AATTGCAGCG
TGGGTTAAGG AACATTAG
 
Protein sequence
MKKVLLGLVC SLFILSMVSC KGKSNQTDQT ISDGELTVTP TITEANITEI PNETPTPGLI 
DSVEFSENGL EDLSAKLTHD LLTENFEEVY SYLEDTVKEQ LSLPDLEKAF HSTVERIGEL
VDAISIKATT TGEYISVDSL VEYTENGLKI SYVYNKDCKL VKLWFSYQPI EEEYDREKME
EIDITIGITS GVTVGEGEFP LDGILTMPKG IKNPPVVVLV QGSGQSDMDE TIGGTSNKPF
RDIARGLASE GIASIRYNKR FYQYMDQASD TMTIYDEVLE DVTYAIQYAK SLTNVNTEKI
FVLGHSLGGM LCPKIAEDNS DIAGFISLAG SPRKLEDLLL DQSIEAVENG TVSESEKTLY
LDTMKAQYEQ IKSLTEENLD EPLLGANGYY WKSLNDIDTP KIVANLTLPM LFMQGEADFQ
VYPEVDFKMW KDLLQEKDNA TFQLYEGLNH LFMPTTGVRD ISDYSVKNKV DDKVILAIAA
WVKEH