Gene CPF_2010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2010 
Symbol 
ID4203230 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2251410 
End bp2253125 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content27% 
IMG OID638082879 
Productsensory box histidine kinase 
Protein accessionYP_696443 
Protein GI110801347 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA GGATTATTAT TTTTACAACA CTAATAATAA CATTTTTTCT AGCAATAATG 
ACTTCTATGT ACTTAGTAAT ATCAAATCAT AAATATTTAG AGGAATCAAA GAATGTTCTA
AATGAATATA ATAAGGTCAT AGCTTTATTG TTAGAAAATG ATAATGGAAA TATTAAGAGT
GAATTAGAAA GAATAGAATC AAATAATGAT ATGAAAAATA TAAGAATAAC ATATATCAGT
AAGGATGGAA ATGTTATTTT TGATACCCAT AAAAAATTAA TCAATGATAA TGAAAGTTAT
CTTAAGAGAC AAGAAATAAT AGAAGCCATA GAAAGTGGTT TAGGAAGCAG TGTAAGATAT
AGTAATGATC TTCATCAAAA CATGATCTAT AGTGCACTTA AACTTAAGGA TGGCTCTATT
GTTAGAACAT CAATAGCTGT TGAAAATGCA AAAATACTAG ATAGCATAAA TAGTAACTAT
TTATTAGTAG GGGTTATATT ATCCTTAGTC ATTGCCTTAC TTTTAACTGT TAAAATAACT
AATATAATAT TAAATCCACT AAAGGAATTA GAACAATTAA CCTCTACTAT TGCAAGTGGT
AATTTTCATA AAAGAGTAAA AATTAATTCT AAAGATGATG AAATTCAAAG ACTAGGAAAA
AGCTTTAATT ATATGGCAGA GCAATTAGAA ATAACCATGG AGAGATTTAA AGATAAACAA
AATGGATTAG AAGCCATATT AAAAAGTATG GGTAGTGGAG TAATAGCTTT TGACAGAGAT
ATGAATGTTT TAATGATAAA TCCTTATGCT AAAAAAATAT TTGGCATAAG CGGAGAGATT
ATTGGAAATA AACTTTTGGA TTATATAACT GATAAAGAGG TACTAAAGGC CTTTTTTGAT
GAAAAAGATA GGGTTGAAAT TGAAGTTAAC TATAATGATG ATCCAAAAAT ATTAAAAATA
AGAAAAGCAA GTATAATAAA TGAACCAGAA ATAATAGGGA CAGTTGTGGT TATACAAGAT
ATTACAGATA TTAAAAAGCT TGAAAACATG AGAAGTCAAT TCGTAGCTAA TATATCTCAT
GAACTTAAGA CCCCACTTAC ATCAATTAAA GGTTTTGCAG AAACCTTAAG ATATGTAGAT
GATGATGAAA CTAGAAATAA ATTTTTAAGC ATAATAGATG AAGAATCAGA TAGATTAGCA
AGACTTTTAG AGGATATATT ATGTCTTTAT GAAATAGAAC AAAAAAGAAG TACTGTTTTA
GAAGAATTTA ATGTTGATGA AGAAATTGAA AAAGTTTATA TGCTATTAAA TGATCAAGCT
AAGAAAAAAG GTGTGGAAAT ATTTTTAGAT ACAAATAGCA ATTGTGTTCT TATGGGAGAT
AAGGATAAGT TTAAACAAAT GTTACTTAAT CTTGTAAGCA ATTCTGTTAA ATATACTGAA
AAAGGTGGAA AGGTAAGAGT TGAAAGTTAT AATCGTGACA TGAATCTTGT TTTAGTTATT
GAGGATAATG GAATTGGAAT AAGTGCAGAG GATCTTCCAA GGATATTTGA AAGATTTTAT
AGAGTAGATA AAGCTAGAAG TAGAGAAAGT GGTGGAACTG GACTAGGTCT TGCCATAGTT
AAACATATAG TTAGACTTTT TGATGGTGAG ATAAATGTAA CTAGTGAACT AGGAGTAGGA
ACTAAAATAG TAATAACTAT ACCTATAAAT ATATAA
 
Protein sequence
MKKRIIIFTT LIITFFLAIM TSMYLVISNH KYLEESKNVL NEYNKVIALL LENDNGNIKS 
ELERIESNND MKNIRITYIS KDGNVIFDTH KKLINDNESY LKRQEIIEAI ESGLGSSVRY
SNDLHQNMIY SALKLKDGSI VRTSIAVENA KILDSINSNY LLVGVILSLV IALLLTVKIT
NIILNPLKEL EQLTSTIASG NFHKRVKINS KDDEIQRLGK SFNYMAEQLE ITMERFKDKQ
NGLEAILKSM GSGVIAFDRD MNVLMINPYA KKIFGISGEI IGNKLLDYIT DKEVLKAFFD
EKDRVEIEVN YNDDPKILKI RKASIINEPE IIGTVVVIQD ITDIKKLENM RSQFVANISH
ELKTPLTSIK GFAETLRYVD DDETRNKFLS IIDEESDRLA RLLEDILCLY EIEQKRSTVL
EEFNVDEEIE KVYMLLNDQA KKKGVEIFLD TNSNCVLMGD KDKFKQMLLN LVSNSVKYTE
KGGKVRVESY NRDMNLVLVI EDNGIGISAE DLPRIFERFY RVDKARSRES GGTGLGLAIV
KHIVRLFDGE INVTSELGVG TKIVITIPIN I