Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPR_1214 |
Symbol | |
ID | 4206372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens SM101 |
Kingdom | Bacteria |
Replicon accession | NC_008262 |
Strand | - |
Start bp | 1360949 |
End bp | 1362214 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 642565770 |
Product | hypothetical protein |
Protein accession | YP_698536 |
Protein GI | 110803501 |
COG category | [S] Function unknown |
COG ID | [COG3581] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.113079 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAATAG AAAATAATAA AAAATTTGCA GTATTTACTA AAGAGATGAG AAAGACTCAT ACTATTTTAG CTCCTACAAT GTTGCCAATA CACTTTAAAC TTTTATCTGG AATAATGAAA AAGTACGGTT ATAAATTAGA ATTTTTTGAT GGTGATACAA ATAGTGCCAT TGAAGAAGGA TTAAAAAGTG TTCATAACGA TATATGCTAT CCAGCTATGA TTGTAATTGG ACAACTCATG GAAGCTTTAA AAAGTGGAAA ATATGACCCA GACAAAACGG CTTTAATAAT GAGCCAAACT GGGGGAGGAT GTCGTGCATC TAACTATATA CATTTATTAA GAAAGGCACT AAAAGAAAAT AACTTAGAGC AAGTTCCAGT AATTTCTTTA AATGCAAGTG GATTAGAGAA GCATGAAGGG TTTAAAATGT ATCCAGGCTT ATTAATAAAA TGCATATATG CTCTTTATTA TGGAGATCTG CTTATGTATA TGTATAATCA GTGTAAATCT TATGAGATTA ATAAGGGTGA AACAGATAGA GTTTTAGATA AGTGGATTGA GATTTTAAAA GAGAAGTTTG AAACCTTAAA ATATTTAAAG GTAAGAGAAC TATATAAAAA TATAATTTTA GATTTTTCTA AGATAGAACT AGAAGAAACT GAAAAAGTAA AGGTTGGTAT AGTGGGAGAA ATTTATTTAA AGTATTCTCC TTTAGGAAAT AATAATTTAG AGGAATTTTT AAGAGAAGAT AATGCTGAAG TTGTTATGTC TGGTGTTACA GATTTCTTTA TGTATTGTCT ATCAAATTCA GAAATAGATT ATAAATTATA TGGAATGAAA AAAGTTTCAA GTAAATTTAC TAAAATTGGA TGTTCATATC TAGAGCATTT ACAAAATATT ATGATAGATT CAATAAAAAA ATATAGTAAG TTTAGAGCAC CTGCTCCATT TAAGGAATTA AAAACTTCTG TTGAAGAGTA CATAGGAAGA GGAGTTAAAA TGGGTGAAGG ATGGCTTATG ACTGCTGAAA TGTTAGAACT TATAAATAGT GGGGTAAATA ATATAGTTTG TGCTCAACCC TTTGGATGTC TTCCTAATCA TATAGTTGGA AAAGGTATGA TTAGAGGAAT TATGGAGAAA CATCCAGAAG CTAATATAGT TGTTGTAGAT TATGATCCCT CTTCAACAAA GGTAAATCAA GAAAATAGAA TAAAATTAAT GTTAGCTAAT GCTAAATTAA GTGAATATAT ATTAAATAAC AACTAA
|
Protein sequence | MEIENNKKFA VFTKEMRKTH TILAPTMLPI HFKLLSGIMK KYGYKLEFFD GDTNSAIEEG LKSVHNDICY PAMIVIGQLM EALKSGKYDP DKTALIMSQT GGGCRASNYI HLLRKALKEN NLEQVPVISL NASGLEKHEG FKMYPGLLIK CIYALYYGDL LMYMYNQCKS YEINKGETDR VLDKWIEILK EKFETLKYLK VRELYKNIIL DFSKIELEET EKVKVGIVGE IYLKYSPLGN NNLEEFLRED NAEVVMSGVT DFFMYCLSNS EIDYKLYGMK KVSSKFTKIG CSYLEHLQNI MIDSIKKYSK FRAPAPFKEL KTSVEEYIGR GVKMGEGWLM TAEMLELINS GVNNIVCAQP FGCLPNHIVG KGMIRGIMEK HPEANIVVVD YDPSSTKVNQ ENRIKLMLAN AKLSEYILNN N
|
| |