Gene CPR_2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2072 
Symbol 
ID4203974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2296371 
End bp2297642 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content33% 
IMG OID642566622 
Producthypothetical protein 
Protein accessionYP_699381 
Protein GI258676983 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGACTG ATTGGTCATC AATAACTGGT AAAAGAGAAG ATATGGAAGA GTTAGATCCA 
GCAGAGAGAA TAAAATCATT TGATGAAGTT GCTTTAGGAT ATACTAAAGA GGAAGCTTTA
AGAGAAGCAG ATAGATGTAG ACAATGTCAA TGTAAGCTTT GTATGAAGGA ATGTATAATG
CTTAATGACT ATACAGATTG TCCAAAAGCT TTATTCAGAG AATATCTAGA AAAAGGTTAT
GAAAACATGG ATAAGATGAT AGCATATTCA TGTAATGAAT GTAATCAATG TACATTAAAA
TGTCCAAAAG AGTTAGATTT AAAAGTTAAC TTTAGAGCTA TGAAAGAAGC TTTTTCTGAA
GAAAATGGTG GGTTAGCACC ATTAGAAGCA TTAAAAGCAA GTGATGCTAC TCAAGAAAAA
GAGTGTGCAG AAGAATACTG TACAACAGTT GAAGCAGCCT CAGTTGAGGA AGTGAAAGAA
AAGAAAAAAG CTAAGAAAAA AACCAAGTAT GTATTTGTAC CAGGATGTAC AGTACCAGCT
TATACTCCAG TGGGAGTAGA AAGTGTTTTA AGACACTTAA AAGATTCTTT AGGAGATGAA
AATGTTGGTG CATTACTTCA ATGTTGCGGT AAAGTAACTT ACTTAATCGG AGAAGAAGAA
AAATACGAAG AAAGAAATAA AAAGGCTATA GATATATTAG ATGAGATGGG AGCAGAAGTT
ATAATAACTG TTTGTCCATC ATGTTATAAA GTATTTAAAG AAACAGCTAA GAATCAAAGA
GTTATAGCAT ACTGGGATTT AATGAAATAT TTAATAGGAA TTCCAGCAGA GTCTAAAGGA
ATAGGAGAAG GTTCAGACGT TATATTTAAC ATACATGATT CATGTGTAAC TAGAGATGTA
ACTTCACATC ATGAAAGTGT AAGATGGATC TTAGACCAAT TAGGATATAA TTGGGAAGAA
GTTGAAAGAA ACGGTAAAAA CACTAGATGT TGTGGTGTTG GAGGAATGGT ATGTAGCTCA
AACCCAGAAT TATACGAGAG AGTATACACT AGAAGAGCTA ATGATTTTAA CCAAGACAAC
ATAGTAACTT ACTGTGGTTC ATGTAGAGGA ACTATGCAAG CTTCTGGCAA AGATGCAGTT
CATATATTAG ATCTTATCTT TGGATCAAAA TATACTAAAG ATCAAGCTCA GCAAAGAGGA
TATAGGACAG AAGAAGAAAT GTGGGCTAAT AGATTAGAAA CTAAAGAAAG ACTAAATAAA
TTTAAAAAGT AG
 
Protein sequence
MPTDWSSITG KREDMEELDP AERIKSFDEV ALGYTKEEAL READRCRQCQ CKLCMKECIM 
LNDYTDCPKA LFREYLEKGY ENMDKMIAYS CNECNQCTLK CPKELDLKVN FRAMKEAFSE
ENGGLAPLEA LKASDATQEK ECAEEYCTTV EAASVEEVKE KKKAKKKTKY VFVPGCTVPA
YTPVGVESVL RHLKDSLGDE NVGALLQCCG KVTYLIGEEE KYEERNKKAI DILDEMGAEV
IITVCPSCYK VFKETAKNQR VIAYWDLMKY LIGIPAESKG IGEGSDVIFN IHDSCVTRDV
TSHHESVRWI LDQLGYNWEE VERNGKNTRC CGVGGMVCSS NPELYERVYT RRANDFNQDN
IVTYCGSCRG TMQASGKDAV HILDLIFGSK YTKDQAQQRG YRTEEEMWAN RLETKERLNK
FKK