Gene CPF_1523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1523 
Symbol 
ID4202136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1739951 
End bp1742314 
Gene Length2364 bp 
Protein Length787 aa 
Translation table11 
GC content23% 
IMG OID638082401 
Productsensory box histidine kinase 
Protein accessionYP_695966 
Protein GI110798905 
COG category[T] Signal transduction mechanisms 
COG ID[COG5002] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00153524 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAATTG GACTTTTTAA CAATTTAGGT ATTAAAGAGA AAACATTAAA AAGAATAGCA 
ATAGTCACAG TAATATTACT TAGTTTATTT TTTATAAAAG GGTTAAATTT AATTTGTGCC
TTAAATAGAG ATTTGTTTAA GGATGTAAGT GTAGTTTATG CTGCTGAAGA AATTATAGTA
ATGACTCAAA TAATTCTTTG TATTATGATA TTATCTATAT GCTTTATATA TTATAGAGGA
CTTAAACGAA AAGAGTTCTT TGGTATATCC CTTGTTTATG TAAGTATTAT TACTGAAATG
ATTTTTATAG TCTTAACAGG AAAAATAGTT GAAAATCAGG TTTATGACTT GAATTTATTT
AGTTTGTTAT TTAGAGGGAT ATTACTACTT TTAGCTGTTT TATGCCTTGA AAATTTTGAT
GCTTTTATAA GGAAATATAA AAAATTAACC TTAGTATTTG TAATATTGGT AACGATTATA
TTACAAATAT TAAATGTTAA TTATAACATA GGAATATATA CATATAATTT TATATATAAT
TTAACTTTAG TTTTAATAGT TTTAACATAT ATATCTTGTA TAATATTTTG TTTAGTTAGA
ATTTTTCAAT TTAGAGAAAT AACTTATTTA GTAATAATGA TAAGTGCTTC ACTTATGTTA
TTAAAATTAG TTTATGGCCT TTCTTTAAGT ATAACTAATA ATAATCTTAT AAAAATAAAC
ATTGTATTTT TTAATTTTAT ATCATTTATG AGCTTTGTAT TTGGTATGTT TTTTGATTTA
CTTCAAGTTA TAAAAAATAA AAATTTTATG CAAGAAGAAC TAAGTGCATT TTTTAACCTA
ATAGAATTTG ATTGTAATAG TGAAGTTGTA GTACTTAGTA ACAATTTAAA AGTTCTTTAT
GCAAATGAAA AATGTAGAAG TAAGAGAATA TCACCTAAAA ATAGAGAAAA TAAAACTTAT
ATTGATTTAG AAAAACAAAT TAAAGGATTT TTATATGATA AAAATATAAT TTGTATAGAA
AGTGTGCTTA GAAATTCTAA GGAATGGAAA GGTATAATAA AGTTAAATGA AGAGGATGAA
GTTGTAAAAG TAAATCTTCA GAGAATAAAA AAAGAAAAAA ATCTTTATTA TGTACTTAGA
ATTAATGATA TAACAGAAGA ATACAAAATG GAGAAAAATT TAAAATTAGA GGAGCAAAGG
CTTAGAGGAG TTACTGAAAA TATAAAAGAT TTAATATTTA CAATTGATGT TGAAGGAAAA
ATAAGCTATG TAAATAAAGC AGTAATAGAT GTTTTAGGAT ATAGTGAAGA AGAATTAATA
GGAAAAAATT ATTATGATTT ATTATTAGTT GAATCTAATT TAAATATAAT AGACAGCAAA
TATTTTAATG AAGATAAGAT TTTAACAATA GATAAGGTAA GATCTAAAAA AGGGTTGGTT
CAATTAGAAT CCATTTCTAG TAGAATTAAG GATAATAAAA ATAATACCTT AGGATGGGTA
AGAGTTGCTA GAAATATAGA GGATGTAAGA GAAATAGAAA TATTAAAGAA TAAATTTGAA
GAAATAAAGC AATATGACAA GGTTAGAAGT GAGTTTTTTG CAAATTTATC TCATGAGCTT
AGAACTCCTA TTAACATAAT ATATTCATGC ATACAGCTTT TAAACACTAG TAAAAAGAAT
AAGGCAAACT TTGCTAATTT ATATGATAAG TATGAAAAAA CTTTAAAACA AAATTGTTTT
AGGATGTTAA GGCTTGTAAA TAATCTTATA GATATAACAA AAATAGACTC TGGTTTTATT
AAGATGGACT TTATTAATTA TGACATAATA AAGCTTACAG AAGATATAAC TATGTCTGTA
ATTCCTTATG TAGAATCTAA GAATATAGAT ATAATCTTTG ATACTAATTG TGAGGAATTA
GAGATAAGAT GTGACCCAGA TAAAATTGAG AGAATAATTT TAAATTTATT ATCTAATGCC
ATAAAATTTA CAGAGCCAGG TGGAAAAATA GAAGTTAGTA TTTTTGCAGA TGAAACTTGG
GTAGATATAA GGGTTAAGGA TACAGGTATA GGAATTCCAT CACACATGAA AGAATTTATT
TTTGAAAGAT TTATACAAAA TGATAAATCC TTAAATAGAA ATAAAGAAGG AAGTGGAATA
GGATTATCCT TGGTTAAATC CTTAGTGGAA TTACATGAAG GAAAAGTTTT CTTAAGAGAA
AGTAATGAAT CAGGTAGTGA ATTTTCAATA TTACTACCTA ATGTGAAATT GGAGAATGAT
GTTTGCGAAA ATGGAAGCTT AGATTATAAA ACAGAGGTTG AAAAAATATC AATAGAGTTT
GCTGATATTT ATGAAATATA TTAG
 
Protein sequence
MLIGLFNNLG IKEKTLKRIA IVTVILLSLF FIKGLNLICA LNRDLFKDVS VVYAAEEIIV 
MTQIILCIMI LSICFIYYRG LKRKEFFGIS LVYVSIITEM IFIVLTGKIV ENQVYDLNLF
SLLFRGILLL LAVLCLENFD AFIRKYKKLT LVFVILVTII LQILNVNYNI GIYTYNFIYN
LTLVLIVLTY ISCIIFCLVR IFQFREITYL VIMISASLML LKLVYGLSLS ITNNNLIKIN
IVFFNFISFM SFVFGMFFDL LQVIKNKNFM QEELSAFFNL IEFDCNSEVV VLSNNLKVLY
ANEKCRSKRI SPKNRENKTY IDLEKQIKGF LYDKNIICIE SVLRNSKEWK GIIKLNEEDE
VVKVNLQRIK KEKNLYYVLR INDITEEYKM EKNLKLEEQR LRGVTENIKD LIFTIDVEGK
ISYVNKAVID VLGYSEEELI GKNYYDLLLV ESNLNIIDSK YFNEDKILTI DKVRSKKGLV
QLESISSRIK DNKNNTLGWV RVARNIEDVR EIEILKNKFE EIKQYDKVRS EFFANLSHEL
RTPINIIYSC IQLLNTSKKN KANFANLYDK YEKTLKQNCF RMLRLVNNLI DITKIDSGFI
KMDFINYDII KLTEDITMSV IPYVESKNID IIFDTNCEEL EIRCDPDKIE RIILNLLSNA
IKFTEPGGKI EVSIFADETW VDIRVKDTGI GIPSHMKEFI FERFIQNDKS LNRNKEGSGI
GLSLVKSLVE LHEGKVFLRE SNESGSEFSI LLPNVKLEND VCENGSLDYK TEVEKISIEF
ADIYEIY