Gene Cphy_3921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_3921 
Symbol 
ID5742047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp4817058 
End bp4818356 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content39% 
IMG OID641295037 
ProductDNA-directed DNA polymerase 
Protein accessionYP_001561007 
Protein GI160882039 
COG category[L] Replication, recombination and repair 
COG ID[COG0389] Nucleotidyltransferase/DNA polymerase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.019973 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTAGAT TTGTCCATAT TTTGAGTGGC AGTGTAGCAA TGCAGGAAAA GATTTTTTTT 
CATATTGATG TTAACTCTGC GTTTCTTAGC TGGGAGGCTG CTTATCGACT TCATGTTTTA
GGGGAAAGCG TGGACTTGCG AGAAATCCCT TCTGTAATTG GAGGAGATAA AGAAAATAGG
CATGGAATAG TTTTAGCGAA ATCTACTTCT GCTAAAAAAC TAAAAATTCA CACAGGTGAA
GCTCTAGGGG CAGCAGTACA AAAATGCCCA AATTTAGTAA TCATTCCACC AAATTACCAA
AGGTATGTAA AAGCATCAAA ATCATTAATG GAGATTTTAC ACAGGTTTTC TCCGAAAGTA
GAGCAGTATT CGATTGATGA AGCTTTTGTT GATATGAGTG GGAGTGAATT GTTATATGGG
GGACCTGTTA TTGTTGCAAA TAATCTAAAA GATTTGATTG AAGAGGAGTT AAAATTTACG
GTAAATATTG GTGTGTCTTC GAACAAATTA TTAGCAAAGA TGGCAGGGGA GTTAAAGAAG
CCGAATTTGG TGCATACGAT GTTTCCAGAG GAGATTCCAA AGAAGATGTG GCCGTTACCG
GTAGGAGAGT TATTTTTTGT AGGAAGAGCG ACAGAAAAAA AACTCTTTAA CCTGGGAATT
AAGACGATAG GGGAGCTAGC ACAAACAGAT GTAAAAATAT TGAAAGCTCA TTTTGGGAAG
TATGGGGAAG TACTTTACCA GTTTTCACAT GGAATCGATG AATCCCCCCT TTTTGTTCCT
TTAGAAGCAA ATAAGGGGTA TGGCAATTCC GTAACGACAC CTTACGATAT TGTTACGATG
GAGCATGCAA ATCTCGTTTT GCTATCGTTA AGCGAAACAG TATGTACGAG ACTTCGAATG
GATGGCGTAA AAGGGCAATG TGTGTCTGTT TCGGTTACAA CAGATACTTT TCAGAGGGCC
TCTCATCAAG GGATGCTTTT TTCGGCTTCT AATACGACGA TGGAGGTATA CCGTTTTGCT
TGCCGTTTAT TTAAGAATCT ATGGGATGGA AGGACGCCAA TTAGACAAAT GGGCGTGCAC
ACAAGTAGGA TTACCAAGGA GAGTACGATG CAGTATAACC TATTTGATTG GGATCGCTAT
GAGAAATTGA GTAAATTGGA TGAAACAATA GATTCTATAC GAAAGAGGTA CGGAGACGAT
TCTGTGATGA GAGCTTGTTT TTTAAATACA AGTACTTATC ATATGCATGG AGGAATATCA
AAAGAAAAGA AAACGGGAAT TACAAAGCCG CTGCGGTAG
 
Protein sequence
MGRFVHILSG SVAMQEKIFF HIDVNSAFLS WEAAYRLHVL GESVDLREIP SVIGGDKENR 
HGIVLAKSTS AKKLKIHTGE ALGAAVQKCP NLVIIPPNYQ RYVKASKSLM EILHRFSPKV
EQYSIDEAFV DMSGSELLYG GPVIVANNLK DLIEEELKFT VNIGVSSNKL LAKMAGELKK
PNLVHTMFPE EIPKKMWPLP VGELFFVGRA TEKKLFNLGI KTIGELAQTD VKILKAHFGK
YGEVLYQFSH GIDESPLFVP LEANKGYGNS VTTPYDIVTM EHANLVLLSL SETVCTRLRM
DGVKGQCVSV SVTTDTFQRA SHQGMLFSAS NTTMEVYRFA CRLFKNLWDG RTPIRQMGVH
TSRITKESTM QYNLFDWDRY EKLSKLDETI DSIRKRYGDD SVMRACFLNT STYHMHGGIS
KEKKTGITKP LR