Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CPF_1101 |
Symbol | |
ID | 4203593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium perfringens ATCC 13124 |
Kingdom | Bacteria |
Replicon accession | NC_008261 |
Strand | + |
Start bp | 1256667 |
End bp | 1258130 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638081982 |
Product | sulfatase family protein |
Protein accession | YP_695547 |
Protein GI | 110800911 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00123552 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAATAT TAATGTTAGA TTTAGACACT TTAAGAGCAG ATCATTTAGG TTGCTATGGA TATGAAAGAA ATACATCTCC TAATATAGAT AGTGTTTCAA GGGAGGGTAT AACTTTTGAT AATTATTATT GTTCAGATGC ACCATGTCTT CCTTCAAGGG CTGCACTTAT GTCAGGAAGA TTTGGAATAC ATACTGGTGT TGTAAATCAT GGAGGAGTTT GTGCTGATTT CAGAATAGAT GGGGATAATA GAGGTTTCAA TGATAGAATG GCATTTAATA GCCTTCCTAT GTTTTTAAGA AGTGAAGGGT TTTATACAGC TTCTATAAGT ACTTTTGCTG AGAGACATAG TGCTTGGTGG TTTAATGCTG GGTTTAATGA GCTTCATAAC GTAGGTGGAT GTGGAGCAGA GTCAGCAGAA GAAGTAACTC CAACTGTCTT AAAATGGATA GAAGATAATT GCGATAAAGA TAACTGGTTC CTTCATGTAA ACTATTGGGA TGCACATACT CCTTATAGAA CTCCAGATAC TTATGAAAAT CCCTTTGAAG GAGATGGTTT AAAGAGATGG ATAAGTGAGG AAAGATTTAA TGAACATCGT AACAATAAAA TTGGACCACA TGGAGCTAGA GAAATAGGAA TGTATAATAG TGATACATCA CCAAGATTTC CAAAACATAT GGGAGAAATT AAAAATTATG ATGATCTTAT AAAATTCTTT GATCAATATG ATTCAGGAAT AAATTATATG GATTCTCATA TTGGACAAAT ACTCAATTTA TTAAAAGATA AGGGATTATA CGAAGACCTT GCTATTATAA TAACATCAGA TCATGGGGAG GCAATAGGTG AATTTGGTAT GTATGCAGAG CACGGAACTG CTGATTATGC AACTACTAAA ATACCAATGA TAATAAAGTG GCCTGGAGCT ATGAAAAATT ACAGAGATGA TGGATTCCAT TATAATTTAG ATTTAGTACC AACTTTAGCT GAGTTATTTA ATAAGGAAAA GAAAGAGTAC TGGGATGGTA GAAGCTATGC ACAAAGTATT TTAAATGGAG AAGATACAGG AAGAGATTAT TTGGTTTTAG GGCAATGTGC TCATGTGTGC CAAAGAGCTG TAAGATTTAA AGATTATATT TATATTAGAA CTTATCATGA TGGATACCAC TTATTTCCAA AGGAAATGCT TTTTAATGTA GAGGATAATC CTCACGAAAT AAGAAATTTA GCAGAGATAA GAAAAGAACT ATGCATGGAA GGAGCATATC TTCTTCAACA GTGGCATGAT GAAATGATGA TGAGTAGTGA AAGTGATGTT GATCCCCTTT GGACTGTAAT AAGAGAGGGA GGACCATATC ATGCTAAAGG TCATTTAAAA GAGTACTGTA AAAGATTAGA GCAAACAGGA AGAGGATGGG CAGTTCCAGA ACTTAAGAGA AGACATCCAG AGGAATTTAA ATAG
|
Protein sequence | MRILMLDLDT LRADHLGCYG YERNTSPNID SVSREGITFD NYYCSDAPCL PSRAALMSGR FGIHTGVVNH GGVCADFRID GDNRGFNDRM AFNSLPMFLR SEGFYTASIS TFAERHSAWW FNAGFNELHN VGGCGAESAE EVTPTVLKWI EDNCDKDNWF LHVNYWDAHT PYRTPDTYEN PFEGDGLKRW ISEERFNEHR NNKIGPHGAR EIGMYNSDTS PRFPKHMGEI KNYDDLIKFF DQYDSGINYM DSHIGQILNL LKDKGLYEDL AIIITSDHGE AIGEFGMYAE HGTADYATTK IPMIIKWPGA MKNYRDDGFH YNLDLVPTLA ELFNKEKKEY WDGRSYAQSI LNGEDTGRDY LVLGQCAHVC QRAVRFKDYI YIRTYHDGYH LFPKEMLFNV EDNPHEIRNL AEIRKELCME GAYLLQQWHD EMMMSSESDV DPLWTVIREG GPYHAKGHLK EYCKRLEQTG RGWAVPELKR RHPEEFK
|
| |