Gene Cphy_0812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0812 
Symbol 
ID5745292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp1038064 
End bp1040445 
Gene Length2382 bp 
Protein Length793 aa 
Translation table11 
GC content36% 
IMG OID641291926 
Productextracellular solute-binding protein 
Protein accessionYP_001557938 
Protein GI160878970 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000378939 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCGAA AAAAGGAGAG AGAGACTATG AGAAAAAACA AGTTTAAAAA AGCATTGTCT 
GTTTTTCTTG TTTTATCCAT GTGTGTTGCT TTAGCAGCTT GTGGAAAGAA AAATAGTGGC
AATCAGTCAG GGGAGAACCC GAGTGAAGGA AATAATTCAG GTAATTCTAG TGATAAACCT
TTAGTAGTTG GTTCCTTACA ATTTAGTGAA AAATTTAGTC CATTTTTTGC TACTACTGGT
TACGATAGGG AAATCGCTGA CTTAACACAA GTTCAGATGT TAACTACAGA TAGAACTGGT
GGTTTAATTT TAAATTCAAT TGAAGGTGAG ACTGTTCCTT ACAATGGTAC AGATTATCTT
TATACAGGTA TAAGTGATAT TGCAGTAAGC CGTGATGAGG CTGCTGATAC CACTACTTAT
AATATTAAGA TTCGTGATGA CATTAAGTTC AGCGATGGCA AGGTAATGGA CGCTGATGAT
ATTATCTTCT CTTATTATGT ATTATCTGAT ACTAGTTATG ATGGAAGTTC TACATTATAC
TCTACACCAA TTGTTGGTAT GTTAAATTAC CGTAAGAACA ATTCTAATGC AGAGAGCACT
GTAGTAACTG AGGATGAAAT TGCTAAAGAG CTTGCTTCTT TAAGTGATGC ATCAAAAGAA
GCAATCAACA AAGAAATAAT TGCTCCAACA TTAACTTCAG AACTTGACTG GGTTAAAGGT
CTCTATGGCA AAGATTCATA CAAAGAGTAC ACAGAAAAAT ACCCAGTAAC AAAAGATTTA
TTTGCTCACT TCTATAGTGT AGATGAGAAT TATGATGCTT CTAAGGTAGA AGACGAAGCT
AAAGTTTTAG AAGAAATTAC AGCAGCTTAC GGTGCTAATT ATCAAGCACT TGGCGAAGCA
TATGCAGGAG ATGAAACTTA CTTTGCTGAC CAGGTTAAGG AAGTTGTATC TAATGTATTA
TTAGAAGAGA AACTTTCAAA AGGCGGAGAT GAGGTTCCTA ATATTGCTGG TATTAAGAAA
ATCAGCCAGA CTGAGGTAGA AGTTACAACC AATGGTTTTG ACGCTTCTGC TATCTACAAC
ATTTGTGGTA TTACTATTTC TCCAATGCAC TACTATGGTG ATGAGGCTCA GTATGATTAT
GATAATAATA ACTTTGGTTT TACTAGAGGT GATTTATCTA TTGTAAAAGC AAAGACTACA
AAGCCTTTGG GAGCAGGTCC TTATAAATTC GTAAAGTATG AAAACAAAGT TGTTTACCTT
GAAGCAAATG AATTTTACTA CAAGGGTGAA CCTAAGACAA AGTACCTTCA GTATAAAGAA
ACAACAGAAG CTGATAAGAT TTCTGGTGTT GGTACAGGTA CAATTGATAT TGCAGATCCA
TCCGGTAGCG TATCTGCTTT CGATGAAATT GCAAACTACA ATTCAAACAA AGAATTATCC
GGTGATAAGA TTATTACAAG TACTGTAGAT AATCTTGGTT ATGGATACAT CGGTTTTAAC
GCCGATGCTA TTAAAGTTGG TAATGATCCA GCTTCTGAAG AATCTAAGAA TTTAAGAAAA
GCGATTGCTA CTATATTAGC AGTATACCGT GATGTAGCTA TTGATAGTTA CTATGGTGAA
GTTGCTAGCG TTATTAACTA CCCTATCTCT AATACATCTT GGGCAGCACC ACAGAAATCT
GATGAGGATT ACAAAGTTGC TTACTCACAG GGAGTAGATG GTAACGATAT CTATACTGCA
GATATGACAG CTGACCAGAA ATACGAAGCA GCAACAAAGG CTGCAATAGA ATTTTTCAAG
GCAGCAGGAT ATACTTTTGA TGATGCAACA GGTAAGTTTA CAAAGGCTCC AGAGGGTGCT
AAATTAGAGT ATGAAATTTT AATTGGTGGA GATGGTAAAG GTGATCACCC ATCCTTCGCG
ATTCTTGCTA AGGCAAAGGA AACTCTTGCT ACAATCGGTA TCACATTAAC AATTAATGAT
CCATCTGATT CTAATGTAAT GTGGGATAAA ATAGAGGCTG GAACTCAAGA AATGTTTGTA
GCAGCTTGGA GTGCAACAAT AGATCCAGAT ATGTACCAGA CTAAGTACAG TACAAACATT
GTCGGTAAAG GTGGTTCTGA TTCTAATCAC TATCATATTG CTGATCCTCA GTTAGACCAG
TTAATTATGG ATGCTAGAAA GAGTGCTGAC CAAGCTTACA GAAAAGCGAC ATATAAGACT
TGTCTTGACA TTATGCTTGA TTGGGGTGTC GAAGTTCCTG TATACCAGAG ACAGAACTGT
ATTATATTTG CTAAAGATCG TGTAAACACT GATACAGTTA CTCCTGATAT CACAACCTTC
TGGAAGTGGA CTAACGATAT TGAAAAGATT GAAATGAAAT AA
 
Protein sequence
MTRKKERETM RKNKFKKALS VFLVLSMCVA LAACGKKNSG NQSGENPSEG NNSGNSSDKP 
LVVGSLQFSE KFSPFFATTG YDREIADLTQ VQMLTTDRTG GLILNSIEGE TVPYNGTDYL
YTGISDIAVS RDEAADTTTY NIKIRDDIKF SDGKVMDADD IIFSYYVLSD TSYDGSSTLY
STPIVGMLNY RKNNSNAEST VVTEDEIAKE LASLSDASKE AINKEIIAPT LTSELDWVKG
LYGKDSYKEY TEKYPVTKDL FAHFYSVDEN YDASKVEDEA KVLEEITAAY GANYQALGEA
YAGDETYFAD QVKEVVSNVL LEEKLSKGGD EVPNIAGIKK ISQTEVEVTT NGFDASAIYN
ICGITISPMH YYGDEAQYDY DNNNFGFTRG DLSIVKAKTT KPLGAGPYKF VKYENKVVYL
EANEFYYKGE PKTKYLQYKE TTEADKISGV GTGTIDIADP SGSVSAFDEI ANYNSNKELS
GDKIITSTVD NLGYGYIGFN ADAIKVGNDP ASEESKNLRK AIATILAVYR DVAIDSYYGE
VASVINYPIS NTSWAAPQKS DEDYKVAYSQ GVDGNDIYTA DMTADQKYEA ATKAAIEFFK
AAGYTFDDAT GKFTKAPEGA KLEYEILIGG DGKGDHPSFA ILAKAKETLA TIGITLTIND
PSDSNVMWDK IEAGTQEMFV AAWSATIDPD MYQTKYSTNI VGKGGSDSNH YHIADPQLDQ
LIMDARKSAD QAYRKATYKT CLDIMLDWGV EVPVYQRQNC IIFAKDRVNT DTVTPDITTF
WKWTNDIEKI EMK