Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1041 |
Symbol | |
ID | 4205306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 1186028 |
End bp | 1187047 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 642565598 |
Product | hypothetical protein |
Protein accession | YP_698364 |
Protein GI | 110802119 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2706] 3-carboxymuconate cyclase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00230518 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAAT CTATTGCATA TATAGGTACT TACACAAATG GGGTCAGTAA AGGTATCTAT AGATTATGTC TAGATACTGC TAAAGGAAAT ATTGAAGATT TATCCGTAGT AGCTAGATTC GGTAATCCTA CTTATTTGTG CATACATAAC AATAAACTAT ATACAGTAGG TAACCCAATT TCACTAGATA CACTAGGTGG TGTTGCTTCT TATAATATAG AAGAAGATTA TTCATTGAAA TTAACAGGGG CTAGTTTACT TCAAGGTAAA AAACCTTGTC ATATTAATAT AATCCCAGAT AAATCCTTAA TAGTTTCTAG TAATTTTCAC GAAAAATCAA TTAATACATA TTCATTAAAT GAGAATTTTG ATATAGATAC TTCATTAAGT GCATTTTCTC ATAAAGATGA CTCAAAAATG CATTTTGCAT CAACAACTCC AGATAACAAA TTTATATGTG CTGTAAATTT AGGTATGGAT AGAATAGAAC TTTTTAAAAT CAATTCTAAT AACACCTTAA GTTACATTGA AAATCTAAGT TTTTATTGTA CTAAAGGATG CGGTCCAAGA CATATAGAAT TTTCAAAGAA TGGAAAGTTT GCATATGTTA TATGTGAAAA TAGTTCTGAA ATAATTATAT TAAAATATTT AGGTGAAGAA GGATTTAAAT TAGTTCAATA TCTTCATGTA CTTCCTAATG GCTTTGGAGG ACAAAATTTT GGTTCTGCAA TAAAAATAAG TCCTTGTAAT AAATTCCTAT ACGTTTCTAA CAGAGGCTTT AATGGAATAT CAGCCTTTAG AATAAATGAG GAAACTGGTT CTTTATCACT TATAAATCAC TATAGTTCAC ATGGTGATTT CCCTAGGGAT TTTGAAATCA GTCCATGTAA TAAGTTCTTA GTAATTGCAA ATGAAAAATC AGATAACCTA ACAATATATT TAAAAAATCC AGATGGAACA CTAAAACTTT TTAAAGATGA TATATTTATT CCATCTCCTA CATGTATAAA ATTTAAATAG
|
Protein sequence | MNKSIAYIGT YTNGVSKGIY RLCLDTAKGN IEDLSVVARF GNPTYLCIHN NKLYTVGNPI SLDTLGGVAS YNIEEDYSLK LTGASLLQGK KPCHINIIPD KSLIVSSNFH EKSINTYSLN ENFDIDTSLS AFSHKDDSKM HFASTTPDNK FICAVNLGMD RIELFKINSN NTLSYIENLS FYCTKGCGPR HIEFSKNGKF AYVICENSSE IIILKYLGEE GFKLVQYLHV LPNGFGGQNF GSAIKISPCN KFLYVSNRGF NGISAFRINE ETGSLSLINH YSSHGDFPRD FEISPCNKFL VIANEKSDNL TIYLKNPDGT LKLFKDDIFI PSPTCIKFK
|
| |