Gene CPR_1728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1728 
Symbol 
ID4204037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1925398 
End bp1927113 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content27% 
IMG OID642566278 
Productsensory box histidine kinase 
Protein accessionYP_699043 
Protein GI110801444 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00635742 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA GGATTATTAT TTTCACAACA CTAATAATAA CATTTTTTCT AGCTATAATG 
ACTTCTATGT ACTTAGTAAT ATCAAATAAT AAATATTTAG AGGAATCAAA GAATATCCTA
AATGAATATA ATAAGGTCAT AGCTTTATTG TTAGAAAATG ATAATGGAAA TATAAAGAGT
GAATTAGAAA GAATAGAATC AAATAATGAT ATGAAAAATA TAAGAATAAC ATATATCAGT
AAGGATGGAA ATCTTATTTT TGATACTCAT AAAAAATTAA TAAGTGATAA TGAAAGTTAT
CTTAAGAGAC AAGAAATAAT AGAAGCCATA GAAAGTGGTT TAGGAAGCAG TGTAAGATAT
AGTAATGATC TCCATCAAAA CATGATCTAT AGTGCACTAA AACTTAAGGA TGGCTCTATT
GTTAGAACAT CAATAGCTGT TGAAAATGCA AAAATATTAG ATAGCATAAA TAGTAACTAT
TTATTAGTAG GTGTTATATT TTCCTTAGTA ATTGCCTTAC TTTTAACTGT TAAAATAACT
AATATAATAT TAAATCCACT AAAGGAGTTA GAACAATTAA CCTCTACTAT TGCAAGTGGT
AATTTTCATA AAAGAGTAAA AATTAATTCT AAGGATGATG AAATTCAAAG ACTAGGAAAA
AGCTTTAATT ATATGGCAGA GCAATTAGAA ATAACCATGG AGAGGTTTAA GAATAAACAA
AATGGATTAG AAGCCATATT AAAAAGTATG GGTAGTGGAG TAATAGCTTT TGACAGGGAT
ATGAATGTTT TAATGATAAA TCCTTATGCT AAAAAAATAT TTGGCATAAG CGGAGAGATT
ATTGGAAATA AACTTTTGGA TTATATAACT GATAAAGAGG TATTAAAGGC CTTTTTTGAT
GAAAAAGATA GGGTTGAAAT TGAAGTTAAC TATAATGATG ATCCGAAAAT ATTAAAAATA
AGAAAAGCAA GTATAATAAA TGAACCAGAA ATAATAGGGA CAGTTGTGGT TATACAAGAT
ATTACAGATA TTAAAAAACT TGAAAATATG AGAAGTCAAT TTGTAGCTGA TATATCTCAT
GAACTTAAGA CACCACTTAC CTCAATTAAA GGTTTTGCAG AAACCTTAAG ATATGTAGAT
GATGATGAAA CTAGAAATAA ATTTTTAAGC ATAATAGATG AAGAATCAGA TAGATTAGCA
AGACTTTTAG AGGATATATT ATGTCTTTAT GAAATAGAAC AAAAAAGAAG TACTGTTTTA
GAAGAATTTA ATGTTGATGA AGAAATTGAA AAAGTTTATA TGCTATTAAA TGATCAAGCT
AAGAAAAAGG GTGTGGAAAT ATTTTTAGAT ACGCATAGCA ATTGTGTTCT TATGGGAGAT
AAGGATAAGT TTAAACAAAT GTTACTTAAT CTTGTAAGCA ATTCTGTTAA ATATACTGAA
AAAGGTGGAA AGGTAAGAGT TGAAAGTTAT AATCGTGACA TGAATCTTAT TTTAGTTATT
GAAGATAATG GTATTGGAAT AAGTGCAGAG GATCTTCCAA GGATATTTGA AAGATTTTAT
AGAGTAGATA AAGCTAGAAG TAGAGAAAGT GGTGGAACTG GACTAGGTCT TGCCATAGTT
AAACATATAG TTAGACTTTT TGATGGTGAG ATAAATGTAA CTAGTGAACT AGGAGTAGGG
ACTAAAATAG TAATAACTAT ACCTATAAAC ATATAA
 
Protein sequence
MKKRIIIFTT LIITFFLAIM TSMYLVISNN KYLEESKNIL NEYNKVIALL LENDNGNIKS 
ELERIESNND MKNIRITYIS KDGNLIFDTH KKLISDNESY LKRQEIIEAI ESGLGSSVRY
SNDLHQNMIY SALKLKDGSI VRTSIAVENA KILDSINSNY LLVGVIFSLV IALLLTVKIT
NIILNPLKEL EQLTSTIASG NFHKRVKINS KDDEIQRLGK SFNYMAEQLE ITMERFKNKQ
NGLEAILKSM GSGVIAFDRD MNVLMINPYA KKIFGISGEI IGNKLLDYIT DKEVLKAFFD
EKDRVEIEVN YNDDPKILKI RKASIINEPE IIGTVVVIQD ITDIKKLENM RSQFVADISH
ELKTPLTSIK GFAETLRYVD DDETRNKFLS IIDEESDRLA RLLEDILCLY EIEQKRSTVL
EEFNVDEEIE KVYMLLNDQA KKKGVEIFLD THSNCVLMGD KDKFKQMLLN LVSNSVKYTE
KGGKVRVESY NRDMNLILVI EDNGIGISAE DLPRIFERFY RVDKARSRES GGTGLGLAIV
KHIVRLFDGE INVTSELGVG TKIVITIPIN I