Gene Cphy_0697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0697 
Symbol 
ID5743813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp902506 
End bp904050 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content37% 
IMG OID641291809 
Productextracellular solute-binding protein 
Protein accessionYP_001557823 
Protein GI160878855 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000787338 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATGA AAAAAGTTAT GCGCAGTATC TTAAGCCTCA TTATGGTCAG CGTGCTCCTA 
GCGGGATGCG GCAAAGGGGC GAAAACAAAT TCATCAGGGA ATAAGTCTGA GGGGGAAGTC
GACTTATCAG AACATGTAAC ACTTACTATG TATCTGTATG GTTCCGCTGG TGTAGCAAAT
GAGGATATTT TAGCAGAAAT TAATCGTAAA CTGACGGAAG ATATTAATAC TTCCATCGAA
ATTAAGTACA TAGATTGGGG CGATATTGGA ACCAAGTATC CGCTAATCTG GGCATCCAAT
GAAGCTTTTG ATATGGCTTA TGCTTCAATC AATACAGTAG TACCGTTTTA TACATTGGCA
AAACAAGGAT CTTTATATGA TATCACTAAT ATCATAGACA AATATGCTCC AACATTAAAA
ACAAGAGTAC CAGAAACAAG CTGGGCAGCA ACAACCGCCG AAGGTAAAAT CTACGGTGTT
CCAACCCTTG GAGCAGGCTT TAACTGTACT GGTTTTGTAT ATAACAAAGC GAATTGTGAA
AAATGGGGTA TTAAAGAAGT TACCGATTTA GACTCTATGA TTGCATATTG TGATGCCTCT
GTTGCTCATG GAATCTATCC TATCAATGGT AATGCAGAAG TTTCTATGGA CTTTTATAAA
ATGCTAGTAG ATACTACCGG AAACTGGGTA CCTGCTCCTG GTATTTCTAC CAGTGAAATG
TATTTTGTAA CTCGTGATTA TCAAGACTAT AAAGATGTGA TCCACCCAGC ATTTACAGAT
GAATTTGCAC AGTATGTGAC AATGCTTGAT GAGTGGGAAC ATAAAGGGTA TTGGCCAAAG
GATATCCTAT CTTCTTCCAC TGGTGATAAG GAAATGTATA AGAATGGTCA GTCTTCCTCT
TATATCACAC ACATGGGTGA TTGGACAGGA AACTATACCG GTATCCATGG TAATTTACCA
GATCAGGATA TGGATGCATG GTACTTTGCA GAAGATAATA ACAAGGTAAT GCAAAGTTCA
CCGGCTCAGG ATATAACTGT TGTTAATGCA AATTCTAAAT ATCCTGAAAG ATGCGTTATG
GTAATTGAGA AGTTCCTAAC TGATAAATCT TATTATAGTT TATTACAGTA TGGTATTGAA
GGTCGCCAAT ATGAAATTAA AGATGGTTTT CTTGAAAAAC CAGCAAATTT CAATGATGCT
GTAGACGCAG GTGGATTTTC AGCTTGGGCA TTTAGTAACT CAGAATTTAA ACTTCCTCGT
AGAACAGAGC ATCCTAGCCG TTACGAAAAA ACAAAAGAGT GGAGTCAGAA TTACCTTAGT
GACCCTTATA CAGGATTTAG CTTTGATTCC TCCAAAGTTA GTACGGAACT TTCAGCGATA
TCTAACGTTA ACTCAACCCT TGGTATACAG CTTATGTTAG GTAAGACAGA AAATGTGGCA
GCTGCTATTG AGGAATATAG AAACCAGCTA AAACAAGCAG GTATTGATAA AGTTTTAGAC
GAATTAAAAA ATCAATTAGC ATCCTTTACG CCTATTATTA AATAA
 
Protein sequence
MKMKKVMRSI LSLIMVSVLL AGCGKGAKTN SSGNKSEGEV DLSEHVTLTM YLYGSAGVAN 
EDILAEINRK LTEDINTSIE IKYIDWGDIG TKYPLIWASN EAFDMAYASI NTVVPFYTLA
KQGSLYDITN IIDKYAPTLK TRVPETSWAA TTAEGKIYGV PTLGAGFNCT GFVYNKANCE
KWGIKEVTDL DSMIAYCDAS VAHGIYPING NAEVSMDFYK MLVDTTGNWV PAPGISTSEM
YFVTRDYQDY KDVIHPAFTD EFAQYVTMLD EWEHKGYWPK DILSSSTGDK EMYKNGQSSS
YITHMGDWTG NYTGIHGNLP DQDMDAWYFA EDNNKVMQSS PAQDITVVNA NSKYPERCVM
VIEKFLTDKS YYSLLQYGIE GRQYEIKDGF LEKPANFNDA VDAGGFSAWA FSNSEFKLPR
RTEHPSRYEK TKEWSQNYLS DPYTGFSFDS SKVSTELSAI SNVNSTLGIQ LMLGKTENVA
AAIEEYRNQL KQAGIDKVLD ELKNQLASFT PIIK