Gene Cphy_3504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3504 
Symbol 
ID5743615 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4322606 
End bp4323628 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content34% 
IMG OID641294614 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_001560592 
Protein GI160881624 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAAAG AAAAAACTTT ATTAATAACT GGTGGAACAG GCTCCTTTGG TAATGCTGTG 
CTTGAGAGAT TTCTTAATAC CGACATAAAA GAGATTCGCA TATTCTCTAG AGATGAAAAG
AAACAAGATG ATATGCGACA TAAATATAAT AATGATAAAA TTAAATATTA CATAGGAGAT
GTCAGGGATT TACAAAGCAT TAAAAATGCT ATGCATGGTG TTGATTATGT TTTTCAAGCT
GCAGCTTTAA AGCAAGTACC ATCATGCGAA TTTTTTCCGA TGGAAGCAGT AAAGACTAAC
ATCATAGGTA CAGACAATGT ATTAACAGCT TGTATAGAAG AGGGAGTGAA AAAGGTAATC
TGTTTATCCA CAGATAAAGC AGCTTATCCT GTCAATGCCA TGGGGACGTC GAAGGCTATG
ATGGAAAAGG TTTTTGTAGC GAAGTCGAGA ACAGTAGATC CAAACAAAAC GTTAATATGT
GGAACTCGTT ATGGTAATGT AATGTGCTCA AGGGGATCTG TAATACCATT ATTTATAGAA
CAAATAAAAG CTGGACAACC ACTTACTGTG ACAGAACCCA CAATGACTCG ATTTATTATG
AGCTTAGAAG AGGCTGTTGA GTTGGTTATA TTCGCTTTTC ACCATGCTGA AAGTGGAGAT
ATTATGGTTC AAAAGGCACC GGCAACAACT ATCGGAGTCT TGGCTCAAGC AATAAAAGAA
TTGTTTAATG TCGATAACGA AATAAAAACT ATAGGAATAC GCCATGGAGA AAAAATGTAT
GAAACATTAT TAACTAATGA AGAGTGTGCA CATGCAATAG ATATGGGCAA CTTTTATCGT
GTTCCTGCAG ATAAACGAGA TTTAAATTAT GATAAGTATT TTAAAGTTGG AGATCAAGGA
AGAGAAAAAT TATCTGAATT TAATTCTAAT AATACGCAGC TACTTACTAT AGAACAGACG
AAAGAAAAAT TATTAACATT ATCCTATATA AGAGAAGAAA TAGAAGCTTG GGAGAACCGA
TAA
 
Protein sequence
MFKEKTLLIT GGTGSFGNAV LERFLNTDIK EIRIFSRDEK KQDDMRHKYN NDKIKYYIGD 
VRDLQSIKNA MHGVDYVFQA AALKQVPSCE FFPMEAVKTN IIGTDNVLTA CIEEGVKKVI
CLSTDKAAYP VNAMGTSKAM MEKVFVAKSR TVDPNKTLIC GTRYGNVMCS RGSVIPLFIE
QIKAGQPLTV TEPTMTRFIM SLEEAVELVI FAFHHAESGD IMVQKAPATT IGVLAQAIKE
LFNVDNEIKT IGIRHGEKMY ETLLTNEECA HAIDMGNFYR VPADKRDLNY DKYFKVGDQG
REKLSEFNSN NTQLLTIEQT KEKLLTLSYI REEIEAWENR