Gene CPF_2055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2055 
Symbol 
ID4202265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2295113 
End bp2296093 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content31% 
IMG OID638082920 
Producthypothetical protein 
Protein accessionYP_696484 
Protein GI110799015 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0543] 2-polyprenylphenol hydroxylase and related flavodoxin oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.690282 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTATG AAGTAATTGA TTGTATAGAT GCTGGTACGG AATTTTGTCC TTGTCACTTA 
GCAGAAGAAG GAGAATGCAT ACTTTGTTCT CAACTTCACG GAAAGTGTTT TTGTGATTGT
GTAAACTGGA AAGGGGTTTG CATATATGAA GAATTTGCAA GTAATGGATT TAAAGCTAAG
GAAGGCAGAA AAACTTTTAC TTGTGATGTC ATAGATGCTG TAGAAGTTGA AGAAGGATTA
TTATTTATAG AGTTTAGGGC ACCTCATAAA CTTTGTATTG ATTTACTTGG ACCAGGTAAG
TTTATATTCA TAAGAACTAA TGATAATCCG TTTTTTGATG TACCTATATC TATTTTAGAA
TCTGATGCAG ATAAAAATAT AATAAAAGTA CTTATTGAAG TTAGAGGAAT AAAAACTAAG
AGGCTCTTAA ATACTGAGGT CAAAGGAGAA ATAACTATAA GAGGGCCTTA TTTTAATGGA
GTGTTTGGAA TAAAAAATAT AGACTCAACT AAGAATGGAG AGGTCCTTGT ACTTTGCAGA
GGAATTGGAT TAGCTCCTGC TGTACCAGTT ATTAAAAAAT TAGCTAATGA AGGAAATAAG
GTTAAGATTC TTTTAGATAA GGCACCTTTT AAAGAAAGTT ATATAGAAAA ATATTTAGAA
GGATATGATG TTCAGTATAT TCCTATGAAT TTAATCAATA AAGGTGAAAT ATCAAATGAA
GCAAAGGAAG TTATTAAAAA TGGACAATGG GATTTAATCC ATTGTGCTGG AGCGGATATA
TTAACTTATA AACTTATAGA ATATTTAAAT GATTTGAAAG ATGAATCTAC AAAAGTATCT
TGTTGTAATA ATGCTAAAAT GTGCTGCGGA GAAGGGGTAT GTGGAAGCTG TACAGCTAGG
TTTGCTGGTC ACAAGGTAAA AAGACTTTGT AAGGTGCAAA GTGATCCTAG AAAAATATTT
GAAGATAGAA GATTCATTTA A
 
Protein sequence
MDYEVIDCID AGTEFCPCHL AEEGECILCS QLHGKCFCDC VNWKGVCIYE EFASNGFKAK 
EGRKTFTCDV IDAVEVEEGL LFIEFRAPHK LCIDLLGPGK FIFIRTNDNP FFDVPISILE
SDADKNIIKV LIEVRGIKTK RLLNTEVKGE ITIRGPYFNG VFGIKNIDST KNGEVLVLCR
GIGLAPAVPV IKKLANEGNK VKILLDKAPF KESYIEKYLE GYDVQYIPMN LINKGEISNE
AKEVIKNGQW DLIHCAGADI LTYKLIEYLN DLKDESTKVS CCNNAKMCCG EGVCGSCTAR
FAGHKVKRLC KVQSDPRKIF EDRRFI