Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_1785 |
Symbol | |
ID | 4202823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | - |
Start bp | 2012076 |
End bp | 2013428 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 638082657 |
Product | PTS system, sucrose-specific IIBC component |
Protein accession | YP_696221 |
Protein GI | 110799103 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific |
TIGRFAM ID | [TIGR00826] PTS system, glucose-like IIB component [TIGR01996] PTS system, sucrose-specific IIBC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000306537 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAG AACAAATAGT TGCTCAAGAA ATATTAAAAA ATATTGGTGG AAAAGAAAAT ATAAAATCAA TGGAACACTG TGCAACTAGA TTAAGACTTA TAGTTAAAGA TAAAAGTCTA ATAAATGAAA AGGCAATTGA GAATATTGAT GGGGTAAGAG GACAATTTTT TGCTGCTGCT CAATATCAAA TAATTCTAGG AACTGGGTTT GTAAATAAAG TCTTTGCAGC TATGAATGGT GAAGGAGTAG AAACTGGGAA TGTAAAAGAA GATGCATATA GTGATATGAC TTTACCACAA AAAATATCTC GTACTTTAGG AGATATTTTC GTTCCTATAA TTCCAGTATT AGTTGCAACA GGTTTATTTA TGGGATTAAG AGGACTTTTA ACTAATTTAG GAGTTGAATT TAGTCCAACT TTTAATACTT TATCAGAGGT TTTAACAGAT ACAGCATTTA TATTCTTACC AGCCTTAGTA GCTTGGTCAA CAATGAAAAA ATTTGGTGGA ACACCTGTAG TTGGTATAGT ATTAGGGCTT ATGCTTGTAG CCCCTCAGCT TCCAAATGCT TGGCAGGTAG CAGGAGGAGC AGATCCAATA TATATTTCTT TATTAGGAAT AAGTATCCCT ATAGTAGGGT ATCAAGGATC AGTACTTCCA GCTTTAGTAT TAGGTATTAT AGCAGCTAAA TTAGAAAAAT TTATTAGAAA GTTTATGCCA GATGTACTAG ATTTAATATT TACTCCATTT TTAACTTTAT TAGTATCAAT GATTCTTGGA CTTTTAGTAG TAGGTCCTAT AATGCATACA AGTGAAGTCT ACATATTAGA TTTATTTAAA ATGTTTTTAA GCTTACCATT TGGAATAGGT GGAGCAATAA TTGGAGGAGT TCATCAAGTT ATAGTTGTAA CTGGAGTTCA TCATATATTT AATGCTTTAG AAGTTGAATT AATTTCAAGT ACAGGATTAA ATCCATTTAA TGCAGTAATA ACCGGGGCTA TAGTAGCTCA AGGAGCAGCA GCTCTTGCAG TTGGATTTAA GACAAAAGAT AAGAAAAAAC GTTCATTATA TATTTCTTCA GCAATACCAG CCTTTTTAGG AATTACAGAA GCGGCCATAT TTGGAGTAAA CTTAAGATTT ATTAAACCAT TTATATTTGC ATGTATAGGT GGAGCTGCAT CAGGAATGTT TGCATCAATG ATGAAATTGG CTGGAACTGG TATGGGAATA ACAGCTATAC CGGGAACACT GCTTTATATT AATACAGGAT TAATTCAGTA CTTCATAACA ATTGCTATTG GATTTGCTAT TTCATTTGCA TTAACATATA TATTCTTTAA ACCACAAGAA TAA
|
Protein sequence | MSKEQIVAQE ILKNIGGKEN IKSMEHCATR LRLIVKDKSL INEKAIENID GVRGQFFAAA QYQIILGTGF VNKVFAAMNG EGVETGNVKE DAYSDMTLPQ KISRTLGDIF VPIIPVLVAT GLFMGLRGLL TNLGVEFSPT FNTLSEVLTD TAFIFLPALV AWSTMKKFGG TPVVGIVLGL MLVAPQLPNA WQVAGGADPI YISLLGISIP IVGYQGSVLP ALVLGIIAAK LEKFIRKFMP DVLDLIFTPF LTLLVSMILG LLVVGPIMHT SEVYILDLFK MFLSLPFGIG GAIIGGVHQV IVVTGVHHIF NALEVELISS TGLNPFNAVI TGAIVAQGAA ALAVGFKTKD KKKRSLYISS AIPAFLGITE AAIFGVNLRF IKPFIFACIG GAASGMFASM MKLAGTGMGI TAIPGTLLYI NTGLIQYFIT IAIGFAISFA LTYIFFKPQE
|
| |