Gene Cphy_0862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_0862 
Symbol 
ID5741734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp1101768 
End bp1103474 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content39% 
IMG OID641291975 
Productextracellular solute-binding protein 
Protein accessionYP_001557987 
Protein GI160879019 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATGG GTAGCAAAAA ATGTATTGCG CTTCTACTTT GTTTCATCAT GCTGTTTTCT 
CTCGCTGCTT GCGGTAAAGG GAAAACAACC GATACAACAG TAGGGGCTAC AACGGGACCA
ACAAAAGCAG TAGTACCTAC AGATGGAACG AAAGAAGGAA CGAAAGAAGG AACAAAGGAT
AATAACAATC AGGGAGAAGT AGTTCCTGGT TGGCAGACAA ACGCATCCGA TAAGGTTGAT
TTAACATGGT ACATAAACTT TTCTTGGTTT ACTACTCCTT GGGGTGAGAA CTTAATATCC
AAGACGATTA CAGAAGAAAC TGGCTGTAAC ATTAATTTTG TCGTTCCAGC AGGTAACGAA
GCTGAGAAAC TAAATTCTTT GATTGCATCA GATTCTTTAC CAGATTTATT AACGATTGGT
TGGTGGGAAG GCCAAGTAAA TCAGATGATC GAAGATGGAA TGGTTTATGC TTTAGACGAA
CTAGCAACGA AGTATGACCC ATACTGGTTT GAAGTAGCAG ATAAAGGTCG TGTTGGCTGG
TTTACAAAAG AAGATGGACA CGTTTACGGA TATCCAAACT CTTCTTTCTC TCCACAAGAT
TATGAAAAAT ATGATAACAT CTCTTCCAAT CAGACCTTCC TTGTAAGAAA GGATATGTAT
GAAGCAATTG GAAGTCCAGA TATGACAACA CCGGAAGGTT TTATTGCAGC AGTAAAAGCA
GCAGCGGCTA AGTACCCAGA AGTGAATGGA CAGCCAATCA TTCCAGTAGG TGCACATGAA
TTTATTTCAA CTGGTAATAA TTCTTTTGAC CTGTATTTAT CTAACTTCTT GGCAACCCCT
TATGAAAAAG ATGGTAAGTA CTACGATAGA TATAGCGATC CTGAATTAGT TCGTTGGTTA
AAAGCTTTTC GCGAATTAGG TGCAGAAGGA TATCTAAAAG AAGATATCTT CGTTGACAAG
AGAGCTCAAA TGGAAGAAAA GATTGCTCAG GGACGTTACT TCTGTATGTT ATATCAGAGA
ACTGATTTAG CAGATCAAGA GAAGGTATTA TACAAGAATG ATCCAAACAG TATTTATATC
GCAATCGATG GACCAAAGAA TTCCAAAGGA GATGCTTACA CCTTACCAGG AACCGGTATC
AATGGTTGGT GTGTAACTAT GATATCTAAG AAGTGTCAGC GTCCAGATCG TGCGATCCAA
TTATGCTCTT ACTTAATGAG TGAGCATGGG CAGCATATGA CTTGGTTAGG TGTAGAAGGT
GTTACTTGGG ATTATGTAAA TGGTAAAGAA ACAATGAAAC CAGAAGTGAA AGAAATTTTA
ACAACTGATC GTAGTGCTTA CGATAAACAG TATGGTGCAG ATTCCTGTTA CTGGATGTTC
CAAGACAATG CAATGTCATT AAAATGGGCG GTGGAAACAC CAGAACCTCT AGGACAGATG
GAACGCTGGA CCTTCCCATA TGTTATATCA GTATCCCAGT ATGATGTTAG CTTAGCAGCA
GATTCTGATG AATTCGATAT TCAGAGTAAT GTAGATAATG AATGGGGTGT TGTATTACCT
CGACTATTAT TAGCGAAGAC AGAAGCAGAC TTTGATACTA TCTGGAATGC ATTTATTCAG
AAGAAAAAAG ATTTCGGTTT TGATAAAGTT CTAGCAACGA AGACTGAGAT GATGAAGGAA
GCAAAAGTGA AATTAGGAGT TAACTAA
 
Protein sequence
MKMGSKKCIA LLLCFIMLFS LAACGKGKTT DTTVGATTGP TKAVVPTDGT KEGTKEGTKD 
NNNQGEVVPG WQTNASDKVD LTWYINFSWF TTPWGENLIS KTITEETGCN INFVVPAGNE
AEKLNSLIAS DSLPDLLTIG WWEGQVNQMI EDGMVYALDE LATKYDPYWF EVADKGRVGW
FTKEDGHVYG YPNSSFSPQD YEKYDNISSN QTFLVRKDMY EAIGSPDMTT PEGFIAAVKA
AAAKYPEVNG QPIIPVGAHE FISTGNNSFD LYLSNFLATP YEKDGKYYDR YSDPELVRWL
KAFRELGAEG YLKEDIFVDK RAQMEEKIAQ GRYFCMLYQR TDLADQEKVL YKNDPNSIYI
AIDGPKNSKG DAYTLPGTGI NGWCVTMISK KCQRPDRAIQ LCSYLMSEHG QHMTWLGVEG
VTWDYVNGKE TMKPEVKEIL TTDRSAYDKQ YGADSCYWMF QDNAMSLKWA VETPEPLGQM
ERWTFPYVIS VSQYDVSLAA DSDEFDIQSN VDNEWGVVLP RLLLAKTEAD FDTIWNAFIQ
KKKDFGFDKV LATKTEMMKE AKVKLGVN