Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_0433 |
Symbol | |
ID | 4204753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | + |
Start bp | 520567 |
End bp | 521904 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 642564990 |
Product | transport protein ysiA |
Protein accession | YP_697762 |
Protein GI | 110802269 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCCGA GTCCCAGTAT TTTACCAAAA ATTATTCTAA TATTAGTTCT TATCTTAATT AATGCGTTCT TTGCAGCTGC AGAGATGGCG ATGGTATCTG TAAATAAATC TAAGATAAAG ATGCTTGCAG AGAAAGGGAA CAAAAAAGCC CTTTTATTAA AAAAGGTTTT AAAATCACCT GGTAACTTTT TATCTACTAT TCAAATAGGA ATAACATTTG CAGGATTTTT TGCCAGTGCA TCAGCAGCCA CTAGCATTTC AGAAACTCTA GCGCAATTCA TGTACAAGCT AAATATTCCT TATGGTAATG AGATATCAGT TATACTTATA ACTGTGCTTT TGTCTTATAT AACTTTAGTT TTTGGAGAAT TACTTCCAAA GAGAATTGCA TTACAAAAGC CAGAAGAAAT TGCTTTAATG GCTATAAGAC CAATCAATGT TATTTCTAAA ATATCAACAC CATTTGTAAA GATTCTTTCA GCTTCAACAA ACTTATTTAT AAAAATATTA GGGTTAAATA AGTCTGAAGA TAAAGAAACT GTATCTAAGG ATGAAATAAA ATCCATGATA AGTATTGGAC AAGAGAGTGG TGTAATTGAT AAAGCTGAAA AAGATATGTT AGATAATATA TTTGAATTTG ATCATAAAGT TGTTAAAGAA GTTATGACTC CTAGGGGAGA AGTCTTTGCT ATAAAATCAA CAACTCCAAA TGAAACAATT GCTAAGAAAC TTATAAGTGA GCAATTTTCA AGAGTTCCTG TTTATAGTGA AACTAGGGAT AATATAGTAG GAATACTTTA TTTAAAAGAC TTCTTTGAAG CCGTTGTAAA GGTTGGAGTA GATAACATTA AATTAGATCA ATTAATACGT CCAGCTTACT TTGTTATTGA AAATAAAGCT ATAGATGATT TATTTAAAGA ACTTCAAGAT AGTAAGCAAC ATATGGCTGT AATAATAGAT GAATATGGTG GTTTTTCTGG AATTGTTACT ATAGAAGACT TAATTGAAGA AGTTATGGGT GATATATTAG ATGAGTATGA CGATTCAGAA AACTATATAG ATAAAATAGA TAATAATACC TATGTAGTTG ATGGTTTATT AACATTAGAC AAGTTAAATG ATTATTTAAA CCTAAATCTT GAAAGTCAAA ATATAGAGAC TATTGGTGGT TTTGTTGTTA ACTTAATAGG AAATATTCCG CAAAGTGAAA ATCAAATGGT TGAATATGAC AATCTTTCTT TCCAAGTTTG TAAAACAAAT AAGAAGAGAA TTGAAAAGCT AAAAATTTAT TTAAATAATT CAACTAGTTT CAATTCAGAT GTTATATTAA ACAATTAA
|
Protein sequence | MDPSPSILPK IILILVLILI NAFFAAAEMA MVSVNKSKIK MLAEKGNKKA LLLKKVLKSP GNFLSTIQIG ITFAGFFASA SAATSISETL AQFMYKLNIP YGNEISVILI TVLLSYITLV FGELLPKRIA LQKPEEIALM AIRPINVISK ISTPFVKILS ASTNLFIKIL GLNKSEDKET VSKDEIKSMI SIGQESGVID KAEKDMLDNI FEFDHKVVKE VMTPRGEVFA IKSTTPNETI AKKLISEQFS RVPVYSETRD NIVGILYLKD FFEAVVKVGV DNIKLDQLIR PAYFVIENKA IDDLFKELQD SKQHMAVIID EYGGFSGIVT IEDLIEEVMG DILDEYDDSE NYIDKIDNNT YVVDGLLTLD KLNDYLNLNL ESQNIETIGG FVVNLIGNIP QSENQMVEYD NLSFQVCKTN KKRIEKLKIY LNNSTSFNSD VILNN
|
| |