Gene CPF_1759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1759 
Symbol 
ID4201223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1979260 
End bp1980975 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content31% 
IMG OID638082631 
ProductBNR/Asp-box repeat-containing protein 
Protein accessionYP_696195 
Protein GI110799218 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00340531 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGGTA AAAAAATATG TAAATCCTTA GAGGGAGATG GAGTTTTAAG TTATTCTGTA 
AAAGAAAGAT TTTTTAGTGA TTTAAAAGAA TTTGTAGATA TAAGTGAAGA TATTAATAAG
ATTAAAAATC TAAAGGAATT TACTATTGTT ATAAAATTTA GAAGCAATAT AAATAGTGGG
GATAAAACTT TATTTAGCAT ATTAAATTCA ACAAAAAACT CTAGTGAATT GGTTTTAAGA
TTAAGTGATG GAAAGTTAAA TCTTTATATA AGGGAAAATC ATAATTTACT TTGTCATATA
AAATCAGCAA AAAAATATGG TGATAATTCT TGGCACATAG TAATTATGTC CTTAGGAGAT
TGGGGAATAA AACTATATGT TGATGGTAAT GAAGTAGGGT ATTTAAAGAG TCCTATTAAT
CTTAGTATGA TTACAGAACT TAATTCTATG AATATAGGAA GAGCTTTAGA TAATAAAGGA
GAAGGTATAA GGCATTTTCA TGGGGATATA GATTACTTAG ATTTATATGA CAGATGCTTA
AGTAGAGAAG AAGTAAAGGA ATTAAGTAAA CAAGAAGTAA AAATTGGATA TGATATTCCT
TTTATAGATT TATCAAAGGA TAAGGATAGA CAGGTTTTAG TTGATAAAGA AGAAGGAGTT
TACTTAGGAC ATCCATCAAC AGTTCTAATG GATGATAAGA AAACCATGTA TGTTGTTTAT
CCAAAGGGAC ATGGTGTAGG ACCAATAGTT CTTAAGAAAA GCGAGGATTC AGGATTAACT
TGGAGTGAAA GATTAGAAAC ACCAGTAAGC TGGAATAATA GTGAGGAAAC TCCTATTATA
TATAAAATAA AGAAGCCAAA TGGTATAAGT AGAATTGAAA TGATATCTGG AATGCCAAGA
GGTGGAGAAA AGGGATTTAG AACCTCATAT TCAGATGACT GTGGAAAAAC ATGGAGTGAA
TTTAAACATT ATTTTCCTAC AGGTAAATAT GGAGGAATAG TTGCTCACGC TAGTTTAACT
AGGCTTAAGA ATAAAAAGGG AGATATGGAT AATAAGTGGC TTGGAATTTT TCATGATCTT
AATTATAATA ACTGGAAAAC TTATTTAAGC TTTGATGAAA GTGGAGAAGA GGTTTGGACA
GAGCCAGTTA GGCTTTTAGA AGAGCATAAC TTAATAGAAA AGACTGCACA GTTATGTGAA
ATTGAAGTAT TACGTTCTCC TGATGGAAAT CAATTAGCTT TAATAGCTAG ATCACAAGGT
AAGAAAAACA ATTCTATGAT TGCATTTTCC AATGATGAAG GTGAAACTTG GACTGAGCCT
TTAGAACTTC AAGGAGCTTT AATGGGAGAA AGACATAAAG CAACTTATGA TCCTATATCA
GGAAGACTAC TAATAACTTT TAGAGAAATA ATAAGAGATT CAAAGAAAAC AGGAGATAAA
AATGACTGGG TAGCTGGTCA CTGGGTTGCT TGGGTAGGAA CCTATGATGA TTTAGTTCAT
AATAGAGAAG GGCAATATAG AATAAGACTT ATGGAAGATT TTACTCCTAC TGAAAAATCA
GGAGATTGTG GATATGCAGG AAACGAAGTT TTAGATGATG GAACTTTTGT ATTAACTTCC
TATGGCTATT GGGAAAAAGA TTATAACAAG CCATATATAA AAAGCTTAAG AGTAACTTTA
AAGGAAATTG ATGAAATAGT TAGAGAGATG GTATAG
 
Protein sequence
MRGKKICKSL EGDGVLSYSV KERFFSDLKE FVDISEDINK IKNLKEFTIV IKFRSNINSG 
DKTLFSILNS TKNSSELVLR LSDGKLNLYI RENHNLLCHI KSAKKYGDNS WHIVIMSLGD
WGIKLYVDGN EVGYLKSPIN LSMITELNSM NIGRALDNKG EGIRHFHGDI DYLDLYDRCL
SREEVKELSK QEVKIGYDIP FIDLSKDKDR QVLVDKEEGV YLGHPSTVLM DDKKTMYVVY
PKGHGVGPIV LKKSEDSGLT WSERLETPVS WNNSEETPII YKIKKPNGIS RIEMISGMPR
GGEKGFRTSY SDDCGKTWSE FKHYFPTGKY GGIVAHASLT RLKNKKGDMD NKWLGIFHDL
NYNNWKTYLS FDESGEEVWT EPVRLLEEHN LIEKTAQLCE IEVLRSPDGN QLALIARSQG
KKNNSMIAFS NDEGETWTEP LELQGALMGE RHKATYDPIS GRLLITFREI IRDSKKTGDK
NDWVAGHWVA WVGTYDDLVH NREGQYRIRL MEDFTPTEKS GDCGYAGNEV LDDGTFVLTS
YGYWEKDYNK PYIKSLRVTL KEIDEIVREM V