Gene Cphy_2347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphy_2347 
Symbol 
ID5745406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium phytofermentans ISDg 
KingdomBacteria 
Replicon accessionNC_010001 
Strand
Start bp2896060 
End bp2897355 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content36% 
IMG OID641293437 
Productextracellular solute-binding protein 
Protein accessionYP_001559447 
Protein GI160880479 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2182] Maltose-binding periplasmic proteins/domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAGA AACTTTTTGG ATTATTTATG GCAACCACAT TAGTAGTTTC CTTAGTTGGT 
TGTGGTAAGA AAGCAGAAAA TCCATCAACT GATAACGGAA AAACAGAAGC AACACAGACT
CCTGGTGCAA CGGAAGCACC TGCAAAAGCA GAGGATGTTA CTTTAAAAGT TTGGGCACCT
GAAAATCAGA TTAAAGATGG AACAATGGAT TCTATGACAA AATCCTTCCA GGAATTACAC
CCAGAATGGA ACATTAAATT TACTATTGAA ACACAGGGTG AAGATACAGC AAAAGATGAA
ATCTTAAAAG ATGTTGGAGC TGCAGGTGAC GTATTCTTCT TTGCTAACGA TCAATTAAAT
GAGCTTGTAA ATGCAGGTGC AATTGCAAAG CTTGGCGGAT CTACAGAAGA AATGGTTAAG
ACAACTATGG CAGAATCAGT TGTTAATACA GTAAAAGTAA ATGATGCTAT TTATGCAATT
CCTTTTACAC ATAATACATT CTTTATGTAC TATGATAAGT CACTTTTAAA CGAAAATGAT
ATTAAATCCA TTGAAGGCAT TATGGCAAAA GAAACTCCTT CTAATGTATA CAATTTCTAT
TTTGAATCAG CAGGTGGCTG GAAATTAGGT GCTTGGTACT ATGGTGCAGG TTTAACAATC
TATGGAGAAA ACCAGACTGA TTTTGCTGCA GGAGCAAATT GGAACAATGA AACAGGCGTT
GCTGTAACAA ATTACTTAAT TGACTTAATT AAGAATCCTA AAGCAGCTTT TGATGGTGAA
ATTTCCTTAT CCGAATTAGC AGGAGATCAT AGAATCGGTG CTTGGTTTGA CGGTTCTTGG
AACTATAAAT TATATAAAGA TGCTTTAGGC GATGACTTAG GTTTAGCAGT AATTCCTACA
TTTAATCCAG ATGGCAATGA TTATCAGTTA AAAGGCTTCT ACGGTTCAAA AGCAATCGGT
GTTAACTCTC ATGCAGCTAA TCCTGCTGTA GCAGTAGCGT TTGCTGCATA CCTTGGAAGT
GAAGAAATGC AAGTACAACG TTTTGAAGAA ACTGGTCAAG TTCCTACAAA CCTTAAAGCT
GGTGAATCAG CAGCTGTTCA GGCAGACGAA GTAGCTAAAG TTATCGTTGA AGAAGCTAAT
GTTGCATCTA TAATGCAGCC TACATCCTCA GAATTCAGTT CAAGATACTG GGCAAATGCA
GGTGGTATTG CTACTGAAAT CAGAAGCGGT GCGTTAAATA AAGATAATGT ACAACAAAAA
TTAGATACTT TTGTTTCCTC ATTAAAAGTA GAATAA
 
Protein sequence
MRKKLFGLFM ATTLVVSLVG CGKKAENPST DNGKTEATQT PGATEAPAKA EDVTLKVWAP 
ENQIKDGTMD SMTKSFQELH PEWNIKFTIE TQGEDTAKDE ILKDVGAAGD VFFFANDQLN
ELVNAGAIAK LGGSTEEMVK TTMAESVVNT VKVNDAIYAI PFTHNTFFMY YDKSLLNEND
IKSIEGIMAK ETPSNVYNFY FESAGGWKLG AWYYGAGLTI YGENQTDFAA GANWNNETGV
AVTNYLIDLI KNPKAAFDGE ISLSELAGDH RIGAWFDGSW NYKLYKDALG DDLGLAVIPT
FNPDGNDYQL KGFYGSKAIG VNSHAANPAV AVAFAAYLGS EEMQVQRFEE TGQVPTNLKA
GESAAVQADE VAKVIVEEAN VASIMQPTSS EFSSRYWANA GGIATEIRSG ALNKDNVQQK
LDTFVSSLKV E