Gene CPF_1101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1101 
Symbol 
ID4203593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1256667 
End bp1258130 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content34% 
IMG OID638081982 
Productsulfatase family protein 
Protein accessionYP_695547 
Protein GI110800911 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00123552 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATAT TAATGTTAGA TTTAGACACT TTAAGAGCAG ATCATTTAGG TTGCTATGGA 
TATGAAAGAA ATACATCTCC TAATATAGAT AGTGTTTCAA GGGAGGGTAT AACTTTTGAT
AATTATTATT GTTCAGATGC ACCATGTCTT CCTTCAAGGG CTGCACTTAT GTCAGGAAGA
TTTGGAATAC ATACTGGTGT TGTAAATCAT GGAGGAGTTT GTGCTGATTT CAGAATAGAT
GGGGATAATA GAGGTTTCAA TGATAGAATG GCATTTAATA GCCTTCCTAT GTTTTTAAGA
AGTGAAGGGT TTTATACAGC TTCTATAAGT ACTTTTGCTG AGAGACATAG TGCTTGGTGG
TTTAATGCTG GGTTTAATGA GCTTCATAAC GTAGGTGGAT GTGGAGCAGA GTCAGCAGAA
GAAGTAACTC CAACTGTCTT AAAATGGATA GAAGATAATT GCGATAAAGA TAACTGGTTC
CTTCATGTAA ACTATTGGGA TGCACATACT CCTTATAGAA CTCCAGATAC TTATGAAAAT
CCCTTTGAAG GAGATGGTTT AAAGAGATGG ATAAGTGAGG AAAGATTTAA TGAACATCGT
AACAATAAAA TTGGACCACA TGGAGCTAGA GAAATAGGAA TGTATAATAG TGATACATCA
CCAAGATTTC CAAAACATAT GGGAGAAATT AAAAATTATG ATGATCTTAT AAAATTCTTT
GATCAATATG ATTCAGGAAT AAATTATATG GATTCTCATA TTGGACAAAT ACTCAATTTA
TTAAAAGATA AGGGATTATA CGAAGACCTT GCTATTATAA TAACATCAGA TCATGGGGAG
GCAATAGGTG AATTTGGTAT GTATGCAGAG CACGGAACTG CTGATTATGC AACTACTAAA
ATACCAATGA TAATAAAGTG GCCTGGAGCT ATGAAAAATT ACAGAGATGA TGGATTCCAT
TATAATTTAG ATTTAGTACC AACTTTAGCT GAGTTATTTA ATAAGGAAAA GAAAGAGTAC
TGGGATGGTA GAAGCTATGC ACAAAGTATT TTAAATGGAG AAGATACAGG AAGAGATTAT
TTGGTTTTAG GGCAATGTGC TCATGTGTGC CAAAGAGCTG TAAGATTTAA AGATTATATT
TATATTAGAA CTTATCATGA TGGATACCAC TTATTTCCAA AGGAAATGCT TTTTAATGTA
GAGGATAATC CTCACGAAAT AAGAAATTTA GCAGAGATAA GAAAAGAACT ATGCATGGAA
GGAGCATATC TTCTTCAACA GTGGCATGAT GAAATGATGA TGAGTAGTGA AAGTGATGTT
GATCCCCTTT GGACTGTAAT AAGAGAGGGA GGACCATATC ATGCTAAAGG TCATTTAAAA
GAGTACTGTA AAAGATTAGA GCAAACAGGA AGAGGATGGG CAGTTCCAGA ACTTAAGAGA
AGACATCCAG AGGAATTTAA ATAG
 
Protein sequence
MRILMLDLDT LRADHLGCYG YERNTSPNID SVSREGITFD NYYCSDAPCL PSRAALMSGR 
FGIHTGVVNH GGVCADFRID GDNRGFNDRM AFNSLPMFLR SEGFYTASIS TFAERHSAWW
FNAGFNELHN VGGCGAESAE EVTPTVLKWI EDNCDKDNWF LHVNYWDAHT PYRTPDTYEN
PFEGDGLKRW ISEERFNEHR NNKIGPHGAR EIGMYNSDTS PRFPKHMGEI KNYDDLIKFF
DQYDSGINYM DSHIGQILNL LKDKGLYEDL AIIITSDHGE AIGEFGMYAE HGTADYATTK
IPMIIKWPGA MKNYRDDGFH YNLDLVPTLA ELFNKEKKEY WDGRSYAQSI LNGEDTGRDY
LVLGQCAHVC QRAVRFKDYI YIRTYHDGYH LFPKEMLFNV EDNPHEIRNL AEIRKELCME
GAYLLQQWHD EMMMSSESDV DPLWTVIREG GPYHAKGHLK EYCKRLEQTG RGWAVPELKR
RHPEEFK