Gene CPR_1904 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1904 
Symbol 
ID4204754 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2102128 
End bp2104308 
Gene Length2181 bp 
Protein Length726 aa 
Translation table11 
GC content30% 
IMG OID642566454 
ProductGTP pyrophosphokinase 
Protein accessionYP_699214 
Protein GI110802270 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG0317] Guanosine polyphosphate pyrophosphohydrolases/synthetases 
TIGRFAM ID[TIGR00691] (p)ppGpp synthetase, RelA/SpoT family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAGAAG AACTAATTTC TAAAATAAAA GCTAATGGGA ATAATGTTGA TATAGATTTA 
GTTAAAAAAG CATATGATTT AGCTTTTGAA GCACATAAAG AACAAAAAAG AGAATCAGGA
GAACCATACA TAACTCATCC TATTAGTGTA GCTATGATAT TAGCTGATAT GGGAATGGAT
ACAAATACTA TAGTTGCAGG ACTTCTTCAT GATGTAATAG AGGATACGGA TTATACTTAT
GAAGATATAA GTAATATTTT TAATGTTGAA GTAGCTAATT TAGTCGATGG AGTTACTAAG
CTTGGAAAAA TAAAATATAA AAGCAAAGAA GAACAACAAG CTGACAATGT AAGAAAAATG
CTTTTAGCAA TGGCTAAAGA CATAAGGGTT ATAATAATAA AACTAGCAGA TAGACTTCAT
AATATGAGAA CACTTAAATA TATGAAGCCT GAAAAACAAA AGAAAAAAGC TCAAGAAACT
TTAGATATAT TTGCACCCTT AGCTCATAGA CTGGGTATAT CTAAAATAAA GTGGGAGTTA
GAAGATTTGT GTTTAAGATA TATTCATCCA GAAGAATATT ATGACTTAGT AAATATGATA
GCTGAAAAAA GGGTGGAGAG AGAAAAATTT ATTTCTCGTA TAATTAAAGA GCTAAAAGAA
AATTTAGATA AAGCTAACAT AGATAGCGAT ATAGAAGGAA GACCAAAACA TTTCTACAGT
ATATATAGAA AAATGGTGAA TAAACATAAG AGCATAGAAC AAATCTTTGA TTTAACAGCT
ATAAGGATTT TAGTTAATAC GGTTAAAGAT TGCTATGCAG TACTAGGTAT AGTACACACT
ATTTATAAGC CAATACCAGG TAGATTTAAA GATTATATAG CAATGCCAAA ACCTAATATG
TATCAATCTT TACATACAAC AGTAATAGGA AGCGAAGGAA AGACTTTTGA AATTCAAATA
AGAACTTTTG AAATGCATAG AACGGCTGAG TACGGAATAG CAGCTCATTG GAAATATAAG
AGTGGTATTA ATGGCACTGA TTCAAAAGAT ATGACTTTTG AAAATAAGTT AACATGGCTT
AGAGATATAC TTGAGTGGCA AAAGGAAGCT GTTGATGCAA CTGAGTTTAT GGAAGGGTTT
AAACTTGACT TATTCTCAGA TGAAATATTT GTATTTACTC CTAAGGGAGT AGTTATAAAT
TTGCCAGTGG GAGCAACTCC CTTAGACTTT GCATATAAGA TCCATACAGA TATAGGAAAT
AAATGCGTAG GAGCTAAAGT AAACGGAAAG ATAGTAACTC TAGATTACAA GCTTAAAACT
GGGGAAATAG TAGAGATATT AACATCCTCA TCATCTAGAG GACCTAATAT AGACTGGTTA
AATATAGCTA ATAGCAATCA AGCTAGAAGT AAAATAAAGC AATGGCTTAG AAAAGCAAGA
AGAGAAGAGA ATTTAGAAAG AGGAAAGGAA ATGCTTGATA AGGAATGTAA AAAGCAATCC
TTAGTATTTT CAGATCTTTG CAAAGGGCCA TTATATGATA AATTATTAAA GAGATATCAT
TTAAATAATG TTGAAGAAAT ATATGTAGCT ATAGGAGAAG GAGAGTTACT TTCATCTACT
GTAATATCTA AGCTTAAAGA GAATATTGTA AAACAGGTTG CTGAAGAGGA ATTAAATAAG
AATATTGAAG AACAAATAGC TAAAACTGAA AGACAAATAA AGAAAAAACA AAACTATGGA
GTAACTGTTA AGGGATTAAA TAATATAATG GTTAGATTTG CAAGGTGTTG TAATCCTGTA
CCTGGAGATG ATATAGCTGG GTATATAACT AAGGGAAGAG GAGTTTCTGT ACATAGAAAA
GACTGTTCTA ATTTTAAAGC TATAGTAGAA AAACAAGAAG AGAAAGTTGT AGATGTTAGT
TGGGGAACTG AAAAGGGAGC TGCATATGTT GCTGAACTTG AGGTTAAAGC AGAAGATAGA
ATGTGTTTAT TATCTGATGT TATGTTAGTT ATAACTGACT CTAATTTTAG ACTACTTTCT
TTAAATGCTA AATCAGGTAG AAATGGAGTA GCAAATATAA ATATTCAAGT AAAGATTGAT
AATATAGAAC AATTAAAAGA ATTAATGAAG AAAATAAGAA GACTACAAGG AATATTAGAT
GTTTATAGAG TAAATAAATA A
 
Protein sequence
MLEELISKIK ANGNNVDIDL VKKAYDLAFE AHKEQKRESG EPYITHPISV AMILADMGMD 
TNTIVAGLLH DVIEDTDYTY EDISNIFNVE VANLVDGVTK LGKIKYKSKE EQQADNVRKM
LLAMAKDIRV IIIKLADRLH NMRTLKYMKP EKQKKKAQET LDIFAPLAHR LGISKIKWEL
EDLCLRYIHP EEYYDLVNMI AEKRVEREKF ISRIIKELKE NLDKANIDSD IEGRPKHFYS
IYRKMVNKHK SIEQIFDLTA IRILVNTVKD CYAVLGIVHT IYKPIPGRFK DYIAMPKPNM
YQSLHTTVIG SEGKTFEIQI RTFEMHRTAE YGIAAHWKYK SGINGTDSKD MTFENKLTWL
RDILEWQKEA VDATEFMEGF KLDLFSDEIF VFTPKGVVIN LPVGATPLDF AYKIHTDIGN
KCVGAKVNGK IVTLDYKLKT GEIVEILTSS SSRGPNIDWL NIANSNQARS KIKQWLRKAR
REENLERGKE MLDKECKKQS LVFSDLCKGP LYDKLLKRYH LNNVEEIYVA IGEGELLSST
VISKLKENIV KQVAEEELNK NIEEQIAKTE RQIKKKQNYG VTVKGLNNIM VRFARCCNPV
PGDDIAGYIT KGRGVSVHRK DCSNFKAIVE KQEEKVVDVS WGTEKGAAYV AELEVKAEDR
MCLLSDVMLV ITDSNFRLLS LNAKSGRNGV ANINIQVKID NIEQLKELMK KIRRLQGILD
VYRVNK