Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_2308 |
Symbol | |
ID | 5745367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | - |
Start bp | 2843299 |
End bp | 2844621 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641293398 |
Product | extracellular solute-binding protein |
Protein accession | YP_001559408 |
Protein GI | 160880440 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2182] Maltose-binding periplasmic proteins/domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000253431 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAAAA AAGTTTTGGC ATTATTGATT ACAACAACAA TGGTGTTTTC AATGGTGGGC TGTGGTAGCA AGGGAAAAGA TGTTAGTAAT TCAACAACAC CATCACCGAC GCCACAAAGT GAAAAGAAAC AAGCAGAGAG CAATAGCAAC CAAGGTGGAG AAGAAAAAAA GATATCTGGT GACCTTCTTG TATGGTTGGA TAACGATGAC TGGGCAGATG CTGTTATCGA AGCATTTAAT GCAAAGTATC CGGATGTAAC TATTGAATAC CAGAACGTAG GCAATGTTGA TACAAGGGGT AAAGTTTCCT TAGACGGTCC TGCAGGCATT GGCCCGGATG TTTTCTTGAT GCCTCATGAT CATATGGGAA TTGCCATTGA AGATGGTTTG TGTGAGCCTA TGACAGATGA GCTTCAAAAA AAATATGAAA ATAACATTTT GGATGCGGCA TTAGAAACTT GTACGGCGGA TGGAAAGGTA TACGGAGTGC CGATTTCAAC TGAGAACATC GCTTTATTCT ATAATAAGGA TTTATACGGA GAAAACCCTC CATCATCTTT TGAAGAAATC ATTGAATTTG CAAAAGGATA TAATGATTTT GCCGCAGGAA AATATACTAT GGCATGGCAA GTAGATGATG CTTATCATAA CTATTTATTC TTAACAGCAT TTGGTATGCA ATTATTTGGA CCAGATATGA GGGACTATAA AACGCCAGGC TGGGATACAC CACAGGTTAC AGAGGCAATT GATTTTTACC GTAGCCTTCG TAAACAACTT TTTGATGTAA ACGTTGTAGA TGCAAGTTGG GACGCAACAG TAGCAGCATT CCAGAGAGGT GAAGTGCCTC TTACTATTTC AGGACCTTGG GCAATTTCAG ATGCTTTGAC AAATGGAGTG AATTTCGGCG TAACAAAACT TCCAACGATC AAGGGCGTAC AACCTAGATG TTTTTCAGGA AATATTATTG CTTCTGTATC TAGTTATGCG AAAAATAAAG AAGCAGCATA TGCATTCGTT GATTTTCTTG CAGGCGAAGA GGGTGCAACA ATTATGTATA AGGTAACAGG CAAGATGACA GCACTGAAAG ACATCTCCAA TATTGCAGGC TTAAAAGAAG ACGTATATTT AAAGGGAATT CAAGAGCAAT CCCCATATGC TGATCCTATG CCGATTATAC CAGAAATGTC ACAGGCTTGG GATGCAATCA AAAACCTGTT TACATTCACC TGGGATAATA CGCTTACTTC TAAAGAAGCA CAGGATAAAG CAATGGATAC ATATAAGACA GCATTACAAG CAGCTGGAAA GACTCTTGAC TAA
|
Protein sequence | MKKKVLALLI TTTMVFSMVG CGSKGKDVSN STTPSPTPQS EKKQAESNSN QGGEEKKISG DLLVWLDNDD WADAVIEAFN AKYPDVTIEY QNVGNVDTRG KVSLDGPAGI GPDVFLMPHD HMGIAIEDGL CEPMTDELQK KYENNILDAA LETCTADGKV YGVPISTENI ALFYNKDLYG ENPPSSFEEI IEFAKGYNDF AAGKYTMAWQ VDDAYHNYLF LTAFGMQLFG PDMRDYKTPG WDTPQVTEAI DFYRSLRKQL FDVNVVDASW DATVAAFQRG EVPLTISGPW AISDALTNGV NFGVTKLPTI KGVQPRCFSG NIIASVSSYA KNKEAAYAFV DFLAGEEGAT IMYKVTGKMT ALKDISNIAG LKEDVYLKGI QEQSPYADPM PIIPEMSQAW DAIKNLFTFT WDNTLTSKEA QDKAMDTYKT ALQAAGKTLD
|
| |