Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cphy_1074 |
Symbol | |
ID | 5741909 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium phytofermentans ISDg |
Kingdom | Bacteria |
Replicon accession | NC_010001 |
Strand | + |
Start bp | 1358682 |
End bp | 1360004 |
Gene Length | 1323 bp |
Protein Length | 440 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641292179 |
Product | extracellular solute-binding protein |
Protein accession | YP_001558191 |
Protein GI | 160879223 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00779107 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGAA AATTAATTAG TATCTTGTTA TGTTTAAGCT TATGTATGGC TTTTTTTGCA GGTTGTAGTA AGCAAGACTC CAAGGGAAAA GTTTATTATC TAAGCTTTAA ACCGGAATCC GATGAGATTT GGAAAGAGAT TGCTAAGGCT TATACCGAAG AAACCGGTGT TGAAGTTGTT GTGAAAACAG CTGCAGCTGG TACTTATGAG CAAACATTAA AAGCAGAGAT TTCTAAGAGT AATGCACCTA CCTTATTTCA AATCAATGGA CCTGTTGGAT ATCAGTCTTG GAAGGATTAT TGTCTTGATT TACAAGGTAC TGATCTTTAT AGCTGGCTCC TAGATAAAAG TTTAGCTGTA TCTTCTGGTG ATGGTGTTTA TGGTATTCCA TATGTGGTTG AGGGTTATGG AATTATTTAT AATGATGCAA TAATGCAAAA ATACTTTGCT TTGTCTAATA AGGCAGTTAC CATCTCTTCT GCAGCTGAAA TTACAAACTT TAATACACTT AAGGCAGTAG TTGAGGATAT GACTGCTAAG AAGGCAGAAC TTGGAATTGA AGGTGTCTTT GCTTCCACTT CTTTTGCTCC AGGTGAGGAT TGGAGATGGC AGACACATTT AGCCAATTTA CCAATCTATT ATGAATTTTT AGATAAGAAG GTGTCAGATG CTGATACTAT CGATTTTACA TACTCTGATA ATTATAAGAA TGTTTTTGAT TTATACATAA ATAATTCCTG TACGGATAAG GGAGTGCTTG GAAGTAAAAG TGTTGCAGAT TCTATGGCTG AATTTGCACT TGGAAAAGTT GCAATGGTCC AGAATGGTAA TTGGGGTTGG GGACAGATTA ATGGTGTAGA AGGTAACACA GTAAAAGAAA CGGATGTAAA ATTTCTACCG ATCTATACTG GAGTAGCAGG GGAAGAAAAG CAAGGATTAT GTACTGGAAC AGAGAACTTC TTCTGTATTA ACAGCAAAAC CTCAAAAGCT AATCAAGAAG CATCCATTGC CTTTATTGAA TGGCTTTATA ACTCTGAAAA AGGAAAAGAC TATGTTACAA ATAAGTTAGG TTTTATCGCT CCATTCTCAA CTTTTAAGGA AAATGAAAAA CCAACAGATC CATTAGCAAA AGAAGTACTC CGCTATATGT CTGACACAAG TAAGGTATCC GTTGCTTGGA ATTTTACTGC ATTTCCAAGT CAGGCATTTA AGGACTATTT TGGCTCCAGC TTATTACAAT ATGCTCAGGA TAAAGATACT TGGCAGGATG TAAAGGATTC CGTAATTAAT TACTGGAAAT TAGAAAAAGA GGCTACAAAG TAA
|
Protein sequence | MKRKLISILL CLSLCMAFFA GCSKQDSKGK VYYLSFKPES DEIWKEIAKA YTEETGVEVV VKTAAAGTYE QTLKAEISKS NAPTLFQING PVGYQSWKDY CLDLQGTDLY SWLLDKSLAV SSGDGVYGIP YVVEGYGIIY NDAIMQKYFA LSNKAVTISS AAEITNFNTL KAVVEDMTAK KAELGIEGVF ASTSFAPGED WRWQTHLANL PIYYEFLDKK VSDADTIDFT YSDNYKNVFD LYINNSCTDK GVLGSKSVAD SMAEFALGKV AMVQNGNWGW GQINGVEGNT VKETDVKFLP IYTGVAGEEK QGLCTGTENF FCINSKTSKA NQEASIAFIE WLYNSEKGKD YVTNKLGFIA PFSTFKENEK PTDPLAKEVL RYMSDTSKVS VAWNFTAFPS QAFKDYFGSS LLQYAQDKDT WQDVKDSVIN YWKLEKEATK
|
| |