Gene CPF_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2026 
Symbol 
ID4202756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2266486 
End bp2267487 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content30% 
IMG OID638082895 
Producthypothetical protein 
Protein accessionYP_696459 
Protein GI110800232 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTTTGA TATCCATATT CATAATTTTA CTTGTGATTA ACTTAGCTGT ATTTGTAGTT 
AAATATAACT CCATAAAGCG AAGCCCTTTA CAATCTAATA AAGCTGATAT AACCTTTAAA
GTTAAAGAAG GAGAGTCACT AAATGGACTT TTTGAACGAT TAAACAATGA AAATGTACTT
AGAAGTTCCT TTTTTTCTAA GATATATATA AAGTTTAATA ACGTAGAAGA AAGTATAAAA
CCAGGTACAT ATACTGTGAA TAGTGACATA AGTTTTAATG ATTTTTTAAG TGTACTAACT
GATGGGAAAG TATCAGATTA TAAGGTAACA TTTCCAGAGG GATACACTGT AGAGGATATA
GCTAAGAAAT TGGAAGAATC TAAGGTATGC ACAAAAGATG AGTTTTTAAA AGTAGTAAAA
GAGTATCCAT TACCATCTTA TATAAAACCT AATAATGAAA GAAAATATGA GTTAGAAGGA
TTTTTATTTC CAGACACATA TGCTATTCCT AAAGGCACAA CACCAAAACA AATAATTGAA
ATGATGCTTA ACAGGTTTGA AGGGGTAATT AGTGAGATAC AAAGTGAACT AGGCATTACT
ATTCCAAAGG AAGAGTATGA GAAATATGTA ATAGTAGCTT CAATGGTTGA AAAAGAGGCT
AGGGATGATA GTGAGCGTGC AGAAATAGCA TCTGTTATAT ATAACAGACT ACAAAAAGGT
ATGCCTTTAC AAATCGATGC TACAGTTTTA TATGCTTTAG GAGAGCATAA AGATACTGTG
CTTTATAAAG ACTTAAAAGT GGATTCACCA TATAATACAT ATAAGATTAA AGGACTTCCA
GTGGGGCCAA TATGTAATCC TGGAAAACCT TCACTTTTAG CTGCCATAAA ACCAGCTAAA
ACAGACTACA TATATTATTT ATTGAATCCA TCAAATAATA AGCACTATTT TACTAATAAT
TACGAAGATT TCCTAGCTAA GAAGAAAGAA TTTGGATACT AA
 
Protein sequence
MVLISIFIIL LVINLAVFVV KYNSIKRSPL QSNKADITFK VKEGESLNGL FERLNNENVL 
RSSFFSKIYI KFNNVEESIK PGTYTVNSDI SFNDFLSVLT DGKVSDYKVT FPEGYTVEDI
AKKLEESKVC TKDEFLKVVK EYPLPSYIKP NNERKYELEG FLFPDTYAIP KGTTPKQIIE
MMLNRFEGVI SEIQSELGIT IPKEEYEKYV IVASMVEKEA RDDSERAEIA SVIYNRLQKG
MPLQIDATVL YALGEHKDTV LYKDLKVDSP YNTYKIKGLP VGPICNPGKP SLLAAIKPAK
TDYIYYLLNP SNNKHYFTNN YEDFLAKKKE FGY