Gene CPF_2102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2102 
Symbol 
ID4201194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2334314 
End bp2335441 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content33% 
IMG OID638082967 
Productputative methyltransferase 
Protein accessionYP_696531 
Protein GI110799710 
COG category[L] Replication, recombination and repair 
COG ID[COG0116] Predicted N6-adenine-specific DNA methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.486637 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTATA ATTTAATAGC CACTGCAACT TTTGGCTTAG AGGCTGTTGT TGCAAAGGAA 
TTAAAAGAGT TAGGATATGA AGACCTAAAA ACTGAAAACG GAAGAGTTCA TTTTGAAGGG
GATGAAATGG ATATTGCCAT AACAAACCTT TGGCTTAGAA CTGCAGATAG AGTTTTAATA
AAAGTGGCTG AATTTAAAGC TGAAAGCTTT GAAGAGTTAT TTAATAAAAC TGTAGAGATT
GATTGGAGTA AGTATATACC TGTAGATGGT AAGATGCATG TTGTTGGTAA ATCTGTTAAG
TCAAAACTTT TTAGTGTTCC AGACTGCCAG TCAATAGTAA AAAAAGCCGT AGTTAAGAGT
ATGAGCAGAA GTTATGGTCA AGATTGGTTC ACAGAAGATG GTCCAGTTTA TAAAATAGAA
GTTGGACTTT TAAAAGATGT GGTTACCTTA ACAATAGATA CTTCAGGAGA GGGATTACAC
AAAAGAGGAT ATAGAGAACA CTCAGGGCAA GCTCCCCTTA AGGAAACACT AGCTGCAGCT
ATGGTTTTAC TTTCAAAGTG GAGAGGAGAG CAAACTCTTA TAGACCCATG TTGTGGATCA
GGAACAATAT TAATAGAAGC TGCTATGATA GCTAAAAACA TAGCTCCAGG ATTACATAGA
AAATTTGTTT CTGAAACTTG GCCTTCAATG GATAAGGAAA TTTGGGATCA AGTTAGAGAG
GGAGCAGAGA AATCTATAAA GAAAATTCCT TTAGATATAA CTGGTTATGA TATAGATAGT
TGGGTATTAA GTACAGCTAA AAATAACGTA AGAAAAGCAG GATTAACTGA TTGTATAACT
ATAGAAAAAA GAAACTTTTT TGATTTTTCA ACTAAGAAAA AGTATGGATA TATGATTACA
AATCCACCAT ATGGTGAGAG AATAGGTGAA AAAGAAATAG TTTCAAAATT AAATAAACAC
TTTGGAGAAG TTAAAGAGAA GTTAGATACT TGGGATTTTA ATATTCTAAC AGCATGTCCA
GATTTCCAAA AAGAGTTTGG TAGAAAAGCT ACTAAAAACA GAAAGCTTTA TAATGGTAGA
CTTTTATGCT ACTACTATCA ATATTTAGAT AATAATTTAA AGAAGTAA
 
Protein sequence
MDYNLIATAT FGLEAVVAKE LKELGYEDLK TENGRVHFEG DEMDIAITNL WLRTADRVLI 
KVAEFKAESF EELFNKTVEI DWSKYIPVDG KMHVVGKSVK SKLFSVPDCQ SIVKKAVVKS
MSRSYGQDWF TEDGPVYKIE VGLLKDVVTL TIDTSGEGLH KRGYREHSGQ APLKETLAAA
MVLLSKWRGE QTLIDPCCGS GTILIEAAMI AKNIAPGLHR KFVSETWPSM DKEIWDQVRE
GAEKSIKKIP LDITGYDIDS WVLSTAKNNV RKAGLTDCIT IEKRNFFDFS TKKKYGYMIT
NPPYGERIGE KEIVSKLNKH FGEVKEKLDT WDFNILTACP DFQKEFGRKA TKNRKLYNGR
LLCYYYQYLD NNLKK