Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_0541 |
Symbol | treB |
ID | 4203920 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | + |
Start bp | 643462 |
End bp | 644883 |
Gene Length | 1422 bp |
Protein Length | 473 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638081423 |
Product | PTS system, trehalose-specific IIBC component |
Protein accession | YP_694995 |
Protein GI | 110800828 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific |
TIGRFAM ID | [TIGR00826] PTS system, glucose-like IIB component [TIGR01992] PTS system, trehalose-specific IIBC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000364471 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAGT TTGAAAAAGA TGCAAAGTTA CTAACAGAGT ACATAGGCGG AAAAGAAAAC GTAGCTGCAG TTACTCACTG TGCTACAAGA ATGAGATTCG TTTTAAATGA TACAAGCAAA GCAGATGTTG AAAAAATAAA AGCTATTCCT TGTGTAAAAG GAACTTTTAC ACAAGCAGGA CAGTTTCAAG TAATAATTGG ACCAGAAGTT GCAACTTTTT ATAATGATTT TGTTGGACAG ACTGGAGTAT CTGGAGAAAG TAAGGAAGAA GTCAAGAAGG CTGCTAAGAA GAATATGAAT ATAGTTCAGA GAGCAGTTGC AGGCTTAGCA GAAATATTTG CTCCGTTAAT ACCAGCTATC ATAGTTGGAG GTCTTATATT AGGATTCCGT AATGTAATCG GAGATATGAA ATTATTTGAA GATGGAACTA AGAGTTTAGT TGAAATTTCT CAATTCTGGG CAGGAACACA TAGTTTCTTA TGGTTAATAG GAGAGGCAAT ATTCCACTTC TTACCTGTAG GAGTTGTTTG GTCAATTGCT AAGAAAATGG GAGCAGATCA AATGCTAGGA ATAGTTATTG GTATAACTTT AGTATCTCCA CAATTATTAA ATGCATATAG TGCAGGTAAT GGAGCCGCTG CTCCAGTTTG GGACTTTGGA TTTGCTCAAG TTCCAATGAT AGGATATCAA GCACAAGTTT TACCAGCAAT CATGGTTGGA TTCACTTTTG TTTACTTAGA AAGATTATTT AAGAAGATTA CACCAGGTCC TATACAAATG ATAATAGTAC CATTTTTCTC AGTAATTCCA ACAGTATTAT TAGCACACAC TGTTTTAGGA CCTATAGGTT GGAAAATAGG TAGCGTAATA TCAGGAGCTA TAGTGGCAGG CTTAACATCA TCATTTGGAT GGTTATTTGC AGGAATTTTT GGAATGATTT ATGCACCATT AGTTATAACA GGATTACATC ATACTTTATT ACCAGTTGAT TTACAACTTA TAAGTGATAT AGGTGGAACA TTCTTATGGC CAATAATTGC ATTATCAAAT ATAGCTCAAG CATCAGCAGT TTTAGCTATG ATATATGTAA ATAGAAAAGA TGAAGATGAA AAACAAATAT CAATACCAGC ATGTATTTCA GGATATTTAG GGGTAACTGA ACCTGCAATG TTTGGGGTTA ACTTAAAATA TCTATATCCA TTCATAGCTT CTATGATTGG AGCAGGAGTT TCTGGTATGT TCTCAATGGC AATGGGATGT ATGGCAAACT CAGTAGGAGT TGGTGGATTA CCAGCAATCT TATCAATGCA ATCATCTAGT ATGTTAATGT ATTTAGTTGC AATGGGAATA GCAATAGTTG TACCATTCAT CTTAACAATA GTGTTCTCAA AGACAAAATT AGCTAATATG GCTTCAAAAT AA
|
Protein sequence | MSKFEKDAKL LTEYIGGKEN VAAVTHCATR MRFVLNDTSK ADVEKIKAIP CVKGTFTQAG QFQVIIGPEV ATFYNDFVGQ TGVSGESKEE VKKAAKKNMN IVQRAVAGLA EIFAPLIPAI IVGGLILGFR NVIGDMKLFE DGTKSLVEIS QFWAGTHSFL WLIGEAIFHF LPVGVVWSIA KKMGADQMLG IVIGITLVSP QLLNAYSAGN GAAAPVWDFG FAQVPMIGYQ AQVLPAIMVG FTFVYLERLF KKITPGPIQM IIVPFFSVIP TVLLAHTVLG PIGWKIGSVI SGAIVAGLTS SFGWLFAGIF GMIYAPLVIT GLHHTLLPVD LQLISDIGGT FLWPIIALSN IAQASAVLAM IYVNRKDEDE KQISIPACIS GYLGVTEPAM FGVNLKYLYP FIASMIGAGV SGMFSMAMGC MANSVGVGGL PAILSMQSSS MLMYLVAMGI AIVVPFILTI VFSKTKLANM ASK
|
| |