Gene CPR_C0014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_C0014 
Symbol 
ID4206661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008265 
Strand
Start bp14684 
End bp15913 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content27% 
IMG OID 
Productphage portal protein, HK97 family 
Protein accessionYP_699943 
Protein GI110804051 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones77 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTTTTCA GAAAAGGCTT TAAAAACCAA AGTCAAGAAA TATCTATAGA TGATAAAAAA 
ATACTTGAAT GGTTAGGAAT AAATCCAAGT GAAACATATG TTAATGGTAA GAGTTGTTTA
AAACAAGCAA CAGTTTTTGG GTGTATAAGA ATTTTAAGTG ATAATATAAG CAAATTACCT
ATAAAAATTT ATCAAAAAAA GGATGGAATA AAAAGAGTTC CAGATCATTA TTTAGAATAT
TTATTGAAAT TAAGACCCAA TCCTTATATG AGTTCTAGTG ATTTTTGGAA GTGTATTGAA
GTTCAAAGAA ATATTTATGG AAATGCATAT GTTGCTTTAG ATTTTAAGAA AAATGGTGAA
ATAAAGGGAT TATATCCTTT GAAATCCGAT GGAATGAAAA TATTTGTTGA TGATACTGGC
CTTTTAAATT CAGAAAACAA TGTTTGGTAT TTATATACTG ATGATTTAGG CCAAAGGCAT
AAGTTTATGA GTGATGAAAT TTTACATTTT AAAGGATTAA CAGCTGATGG TTTAGCTGGA
CTAAGTGTTA TTGAATTATT AAATCATTTA ATAGAGAATG GAAAAAGTTC AGAAACTTAT
TTAAATAATT TCTTTAAAAA TGGATTACAA GTTAAAGGCT TAGTTCAATA TGCTGGAGAT
TTGAATCCAG AAGCAGAAGA AGTTTTTAAA GAAAATTTTG AAAGAATGTC TAGTGGTTTA
AAAAATGCAC ATAGAATAGC TATGTTACCT ATAGGATATA AATTTGAACC TATAAGTCAA
AAATTAGTTG ATGCACAATT TTTAGAAAAC TCTCAATTAA CAATAAGACA AATTGCTTCA
GTTTTTGGAG TTAAAATGCA CCAATTAAAT GATTTAGATA GAGCAACACA TTCTAACATT
ACAGAGCAAA ACAGAGAATT TTATATTGAT ACATTACAAT CAATATTAAA TATGTACGAG
CTTGAAATTA ATTATAAATT ATTTTTAATC AGCGAAATAA AAAATGGATT TTACTCAAAG
TTTAATGTAG ATACAATTTT GAGAGCTGAT ATAAAAACAA GATATGAAAG TTATAAAGAA
GCTATTCAAA ATGGATTTAA AACTCCTAAT GAAATCAGAG AATTAGAAGA GGATGAACCT
TTAGAAGGTG GAGATGTTCT TTTAATTAAT GGTAATATGA TTCCAGTAAA AATGGCTGGG
GAACAGTATT CGAAAGGGGG TGAAAAATAG
 
Protein sequence
MLFRKGFKNQ SQEISIDDKK ILEWLGINPS ETYVNGKSCL KQATVFGCIR ILSDNISKLP 
IKIYQKKDGI KRVPDHYLEY LLKLRPNPYM SSSDFWKCIE VQRNIYGNAY VALDFKKNGE
IKGLYPLKSD GMKIFVDDTG LLNSENNVWY LYTDDLGQRH KFMSDEILHF KGLTADGLAG
LSVIELLNHL IENGKSSETY LNNFFKNGLQ VKGLVQYAGD LNPEAEEVFK ENFERMSSGL
KNAHRIAMLP IGYKFEPISQ KLVDAQFLEN SQLTIRQIAS VFGVKMHQLN DLDRATHSNI
TEQNREFYID TLQSILNMYE LEINYKLFLI SEIKNGFYSK FNVDTILRAD IKTRYESYKE
AIQNGFKTPN EIRELEEDEP LEGGDVLLIN GNMIPVKMAG EQYSKGGEK