Gene CPR_2027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2027 
Symbol 
ID4205970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2234888 
End bp2236537 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content31% 
IMG OID642566577 
Productputative manganese-dependent inorganic pyrophosphatase 
Protein accessionYP_699336 
Protein GI110803953 
COG category[C] Energy production and conversion 
COG ID[COG1227] Inorganic pyrophosphatase/exopolyphosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGATG TTATATACAT TACAGGACAT AAAAATCCTG ATTCAGATTC AATATGTGCT 
GCTATAGCAT ATGCTGAATT TAAAAATAAA ACTCAAGATA CTCTTGCTAT ACCAGTTAGA
TTAGGAAATG TTAGCCAAGA GACTCAATAC ATACTTGATT ATTTTGGTGT AGAAGCACCT
CAATTTTTAG AAACAGTTAA ACTAAAAGTT GAAGACTTAG AAATGGATAA CATAGCTCCA
TTAGCTCCAG AAGTTTCATT AAAAATGGCT TGGAACATAA TGAGAGATAA AAACTTAAAA
TCAATACCTG TTGCTGATGG AAATAACCAT TTACTTGGAA TGTTATCAAC ATCAAACATA
ACAGAAACAT ACATGGATGT TTGGGATAGT AATATTCTTG CTAAAAGTTC TACTTCATTA
GACAATATAT TAGATACATT GTCTGCTGAG GCTCAAAACA TAAATGAAGA AAGAAAAGTA
TTCCCAGGAA AAGTTGTAGT TGCTGCTATG CAAGTTGAAT CTTTAAAAGA ATTCATAAGC
GAAGGCGACA TAGCTATAGC TGGAGATAGA GCTGAAATAC AAGCTGAATT AATAGAGTTA
AAAGTTTCTC TATTAATAGT AACAGGTGGA CATACACCTT CTAAAGAAAT AATAGAACTT
GCTAAGAAAA ATAATATAAC TGTTATTACA ACACCTCACG ATTCATTTAC TGCATCAAGA
TTAATCGTTC AAAGTTTACC AGTTGACTAT ATTATGACTA AAGATAACTT AGTAACTGTA
TCTACTGATG ATTTAGTTGA AGATGTAAAA GTAATAATGA GTGAAACTAG ATACAGTAAT
TATCCTGTAA TAGATGAGAA TAATAAAGTA GTTGGTTCAA TTGCGAGATT CCACTTAATT
TCTACTCATA AGAAAAAAGT TATTCAAGTT GATCACAACG AAAGAGGTCA ATCAGTACAT
GGTTTAGAAG ATGCTGAAGT ATTAGAAATA ATCGACCATC ATAGAGTTGC AGATATACAA
ACTAGTAATC CTATATACTT TAGAAATGAA CCTCTTGGAA GTACTTCAAC AATAGTTGCT
AAAAGATTCT TCGAAAATGG AATAAGACCA TCAAGAGAAG CTGCTGGACT TTTATGTGGT
GCTATAATTT CAGATACATT ATTATTCAAA TCACCTACTT GTACTCCACA AGACATAAAA
ATTTGCAGAA GACTTGCTGA AATAGCTGGA ATAGTTCCTG AAACATTTGC TAAAGAAATG
TTTAGAGCCG GAACTTCCTT AAAAGGAAAA TCTATTGATG AAATATTTAA CTCCGATTTT
AAACCTTTCA CAATTGGAGG TATTAAAATA GGTGTTGCTC AAGTTAACAC TATGGATATA
GAAGGATTTA TGCCTTTAAA AGATGAAATG TTAGATTATA TGAATCAAAA GGCTGATTCA
ATGGGATTAG AAATGATTAT GTTATTACTT ACTGACATAA TAAATGAAGG TTCACAAATA
CTAGTTACAG GAAGAAATCC AGAAATAGCT GAAGAGGCTT TTAAAGTTAA ATTAGAAGAT
TCAACTACTT TCTTACCTGG AGTCCTTTCA AGAAAGAAAC AAGTTGTACC TCCATTAACT
CAAATAATTA CAACAAGAGT TAGTAAATAA
 
Protein sequence
MKDVIYITGH KNPDSDSICA AIAYAEFKNK TQDTLAIPVR LGNVSQETQY ILDYFGVEAP 
QFLETVKLKV EDLEMDNIAP LAPEVSLKMA WNIMRDKNLK SIPVADGNNH LLGMLSTSNI
TETYMDVWDS NILAKSSTSL DNILDTLSAE AQNINEERKV FPGKVVVAAM QVESLKEFIS
EGDIAIAGDR AEIQAELIEL KVSLLIVTGG HTPSKEIIEL AKKNNITVIT TPHDSFTASR
LIVQSLPVDY IMTKDNLVTV STDDLVEDVK VIMSETRYSN YPVIDENNKV VGSIARFHLI
STHKKKVIQV DHNERGQSVH GLEDAEVLEI IDHHRVADIQ TSNPIYFRNE PLGSTSTIVA
KRFFENGIRP SREAAGLLCG AIISDTLLFK SPTCTPQDIK ICRRLAEIAG IVPETFAKEM
FRAGTSLKGK SIDEIFNSDF KPFTIGGIKI GVAQVNTMDI EGFMPLKDEM LDYMNQKADS
MGLEMIMLLL TDIINEGSQI LVTGRNPEIA EEAFKVKLED STTFLPGVLS RKKQVVPPLT
QIITTRVSK