Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_1077 |
Symbol | |
ID | 4201672 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | + |
Start bp | 1229188 |
End bp | 1230621 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 29% |
IMG OID | 638081958 |
Product | extracellular solute-binding protein |
Protein accession | YP_695523 |
Protein GI | 110800997 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.182725 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAAAAA GAAAAAAAGC ATTATCACTG ATTTTAAGTG GTATTGTATG TTCATCTTTA ATACTTACAG GCTGTGGAAA TAGCAAAGAA GAGGAAAGTA AGGAAACTGT TAATTTAACT TGGTATGTTA TAGGTGATGA GCCAGCTGAT AATGACATTG TTGAAGAAGA AGTAAATAAA TATTTAAAAG ATAAAATAAA TGCTACAGTA GATATAAAAC ATATACCTTT TGGGGATTAT ACTAAGAAAA TGAGTGTTAT TAGTAATTCA GGGGAACCTT ATGATCTTGC CTTTACATGT TCTTGGGCAT TTCCTTATTT AGAATATGCT AGAAAAGGAG CCTTTTTAGA GTTAAATGAT TTATTAGATA CAGAAGGAGC ACCATTAAAG AAAGAAATAA ACAAAGAGTT ATGGAAGGGT GCAGAGATAG ATGGTAAAAT ATATGCTGTT CCAAATCAAA AAGAAATAGC ATTAGCACCT ATGTGGGTAT TTGATAAAGA GCTTGTTGAA AAGTATAATA TTCCTTATGA AAATATACAT TCAGTAAACG ATTTAGAACC TTGGTTAAAA TTAGTAAAAG AAAAAGAACC GGATTTTATT CCATTTTATA CTCAAGGAGA TTCAATTCCT TTAGACTTTG ATGATATAGT AAATCCATTA GGTATATTCT ATAATGATAA GAATTTGACA GTAACAAATA AATTTGAATC AAAAGAAATG AAGGATATGT TATTAAAATT AAGAGAATAT TATGAAGCTG GATACATAAA TCAAGATGCA GCAGTAACTG ATATGAAACC AGAAGTAAAA AGATTTGTAT GGAAAGCAGA TGGACAACCA TATGCTGAAA ATATATGGGG AAAATCTTTA GGTAGGGAAG TAGTAACTTC ATCAATAATA CCACCATATA TAACTAATAA TTCAACAACT GGTGCAATGA CTGCAATTTC TGCTAACTCT AAACATCCTA AGAAGGCAAT GGAGTTATTG ACTTTAGTAA ATACTGATAC AAATTTAAGA AACTTATTGA TGTTTGGAAT AGAGGGAAAA CATTATGAAA AAATAAATGA CAAGCAAATA AAGAAATTTG ATGGAAAGAA ATATGATGTT GTTAGCTGGG CATATGGAAA CTTACTTGGT ACTTATGTTT CAGAAAATGA CCCTATAGAT AAGTGGGAGG CTTTTGAAAA ATTCAATAAT TCAGCTAAGG TTTCTCCAAT ATTAGGATTT AAATTTAATT CGGAAAACGT TTCAAATCAA ATTTCTGCTA TAAATAATGT TTTACAGGAA TTTGAAAGAG CATTATATAC AGGTTCAATA GATCCTGAAG TTGGACTAAA TGATTTAAAT AAGAAATTAA ATGATTCAGG TATAAATGAA GTTAAAGATG AAATACAAAA ACAATTAAAT GAATGGAAAG CAAAAAATAA TTAG
|
Protein sequence | MIKRKKALSL ILSGIVCSSL ILTGCGNSKE EESKETVNLT WYVIGDEPAD NDIVEEEVNK YLKDKINATV DIKHIPFGDY TKKMSVISNS GEPYDLAFTC SWAFPYLEYA RKGAFLELND LLDTEGAPLK KEINKELWKG AEIDGKIYAV PNQKEIALAP MWVFDKELVE KYNIPYENIH SVNDLEPWLK LVKEKEPDFI PFYTQGDSIP LDFDDIVNPL GIFYNDKNLT VTNKFESKEM KDMLLKLREY YEAGYINQDA AVTDMKPEVK RFVWKADGQP YAENIWGKSL GREVVTSSII PPYITNNSTT GAMTAISANS KHPKKAMELL TLVNTDTNLR NLLMFGIEGK HYEKINDKQI KKFDGKKYDV VSWAYGNLLG TYVSENDPID KWEAFEKFNN SAKVSPILGF KFNSENVSNQ ISAINNVLQE FERALYTGSI DPEVGLNDLN KKLNDSGINE VKDEIQKQLN EWKAKNN
|
| |