Gene CPF_2667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2667 
Symbol 
ID4202205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2940556 
End bp2942562 
Gene Length2007 bp 
Protein Length668 aa 
Translation table11 
GC content29% 
IMG OID638083533 
Productsigma-54 dependent transcriptional regulator/sensory box protein 
Protein accessionYP_697047 
Protein GI110798683 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG1925] Phosphotransferase system, HPr-related proteins
[COG3829] Transcriptional regulator containing PAS, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR01003] Phosphotransferase System HPr (HPr) Family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATTCAA GAGAATTGAT TATAAAGACG CCCTTAGGAT TACACACTAG ATACTCTGCT 
ATGATTGTTA ATAAGGCTTC TGAAATAGAG TCTAAGTATA AAGTTAAACT CTATATAAAA
AAAGAATCCT ATACAGATTG GCTTGGAATA AGTATGCTTG CAATACTTTC TCTTAAGGTG
TTGCCTAATG AAAGTATATT AATTGGATCT AAAAATGAAG GAATGATTGA AAAGTTAGCT
GTTCAATCTC TTCTTGAGTT TATTGATAAA AACATAAATA ATCCCATTGA ATCTGAGGAT
AGTGAAATAG ATGAAATAAT AGATGCTAGC ATAGTAGCTA ATGAACAGGT TCTTGAAAGT
TTGCCAATAG GTATTGTTGT TATAGATATA GATCAAAATA TAATAACAAT TAATCAATAT
GCTTTAAGAT TTATTGGATG CAATAAAAAA GATGTTAAAG GAAAAAAGAT TAATGAAGTT
ATTCCATCAT CTCAGCTTCC TCATGTTATG CTTAGTAATT TAAAAAAGTA TGGCTCAACT
CTTCATATAA ATAACAGAGT TGGCTTAGTA AATAGCTCTC CTCTTTTCAT AAACGATAAG
ATTATAGGGG CTGTAAGTGT TATTCAAGAT GTATCTGATA TTATAGGTAT GAAAGAAATA
AATGAGAAAT TTACTAAGAT ACTTGAAAAT TCACAGGATA TGATTTGCTT TGTAGATGAA
AATGGTATAA TAAATTATTT AAATCCTGCA TATATTAAAA ACTTTTCTAA AGTTAGTTCT
GATGTTATAG GAAAAAGTAT TTTTGATATT GCTCCTAATG GACTTAGAGC TAAAGTATTT
AAAGAAAAAA CTTTATTAAA AGATGTAATT CATAAGAAAA ACGGTATAAA TGTTATAAGT
ACTATTGATC CTTTATTTAT TGATGGACAG TTTAAAGGAG TTATTTCTAC TTCTAGACCA
GTTAGTTTAA TTAAAGAACT TATGTCAAAA CTAAATAAAT CTGAACAGGA ATTAGATTAT
TATAAAAATG AGTTTTTAAG ACAGTTATCT AAAAATTCTT CTTTTAATAA TATTATAGGT
TCAACTAGAA CTCTAAAGGA TATAATGTAC ATGTGTCAAA AGGCTTCAGA AACCACATCA
ACTGTTCTTA TTCGTGGTGA AAGTGGTACA GGTAAGGAAC TTATTGCAAA GGCTATACAC
AATAATAGTA ATAGAAAAAA TAAGCCTTTC GTAAGGGTTA ACTGTGCTTC AATTCCTGAA
AATCTCTTAG AAAGTGAATT GTTCGGATAT GAAAAGGGAG CTTTTACTGG GGCAGTTCAA
AGCAAACCTG GTAAATTTGC TATTGCTGAT ACAGGTACTA TATTCTTAGA TGAAATAGGT
GATATGCCCT TATCAATGCA AGTTAAACTC CTTAGAGTTT TACAAGAAAG AGAGATTGAA
AGTGTTGGAG GAATAACTCC TAGAAATATT GATGTGAGGG TTATTGCTGC AACTAATAGA
AATTTAGAAG AAATGATTGA AGAGGGTTCC TTTAGAGAGG ATCTTTATTA CAGACTTAAT
GTCTTAGGTA TTAATCTTCC TCCACTTAGA GAACGTAAAG AAGATATTCC TGAACTAGCC
GAACATTTTA TAACTAAGCT AAATAAAAAG TTACATAAAA CTATATTAGG AATAAAACAA
GATGCCCTTA ATCTACTAAT TGAGTATTCT TGGCCTGGAA ATATACGTGA GCTTGAAAAT
ATAATGGAAA GGGCCATAAA TCTTTGTGAT GGAGATTATA TAGATAGTTC TTATCTGCCT
TCATATTTAA AGCCAGTTGA ATCAAAGTCT TTTAATTTAA ATATTGATAT TGATCATATA
CTTCCCTTTG AAGAATATGA AAAACAAATA ATAGAAGCAG CTATGAAGAA GTACAAATCC
TTTAATAAGG CAGGAAAAGC TTTAGGATTA ACTCATAGAA CAGTTTCTTT GAAATGTAAA
AAGTATAATA TAGATGTAAA AAAATAA
 
Protein sequence
MYSRELIIKT PLGLHTRYSA MIVNKASEIE SKYKVKLYIK KESYTDWLGI SMLAILSLKV 
LPNESILIGS KNEGMIEKLA VQSLLEFIDK NINNPIESED SEIDEIIDAS IVANEQVLES
LPIGIVVIDI DQNIITINQY ALRFIGCNKK DVKGKKINEV IPSSQLPHVM LSNLKKYGST
LHINNRVGLV NSSPLFINDK IIGAVSVIQD VSDIIGMKEI NEKFTKILEN SQDMICFVDE
NGIINYLNPA YIKNFSKVSS DVIGKSIFDI APNGLRAKVF KEKTLLKDVI HKKNGINVIS
TIDPLFIDGQ FKGVISTSRP VSLIKELMSK LNKSEQELDY YKNEFLRQLS KNSSFNNIIG
STRTLKDIMY MCQKASETTS TVLIRGESGT GKELIAKAIH NNSNRKNKPF VRVNCASIPE
NLLESELFGY EKGAFTGAVQ SKPGKFAIAD TGTIFLDEIG DMPLSMQVKL LRVLQEREIE
SVGGITPRNI DVRVIAATNR NLEEMIEEGS FREDLYYRLN VLGINLPPLR ERKEDIPELA
EHFITKLNKK LHKTILGIKQ DALNLLIEYS WPGNIRELEN IMERAINLCD GDYIDSSYLP
SYLKPVESKS FNLNIDIDHI LPFEEYEKQI IEAAMKKYKS FNKAGKALGL THRTVSLKCK
KYNIDVKK