Gene CPF_2822 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2822 
Symbol 
ID4201689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp3083918 
End bp3085132 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content29% 
IMG OID638083689 
Productheme/steroid binding domain-containing protein 
Protein accessionYP_697186 
Protein GI110798636 
COG category[R] General function prediction only 
COG ID[COG4892] Predicted heme/steroid binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACAATT TTTTTGAAGA AAAGTTTGAG CAAATAAATA AATTGAAACA TTTAATGACT 
TTATCTAAAG GAAATAATCA AGCTAAAACA TATAAAGAAC AAATAGAAGA TATTTTTAAA
GAATTTCAAA GGGACTATAT AAGAGAAGAG AGTAATACTA TGCTTATTCT GACAAAGGAG
GAATTAGCAA ATTTTAATGG GGAAAATGGA AATCCAGCCT ATATTTCTAT AGATGGTATT
ATATATGATA TTTCAAATAT AGAATTGTTT AAACAAAGTC CTTATAACAG TTTAAAACTT
GGAAGTGATG TAACAGAGGC CTTTGATGAA TTAAATGATG GAGATGAATC TATATTAAGA
GATATTCCTA TGGTAGGGCT TTTAGCAGAA CCTGAGGAAG CTGAAATTAT GAATGGAGTG
GAAGAGTATT ACCCACAACA AAGACTAAAA ATAATGAGTT CAGAAGAACT TAAAAAATAT
AATGGAGAGA ATGGAAGTTT AGCATATGTA GCTATTGAAG GAATAGTTTA TGATATAACC
AATTTATCAG TTTTGGAGCT ATTTAATGGG ACAAAATTAA GATTAGGCTC TGATATTACT
GAAGAATTTA AAACTTATTA CAGAGGGGAT AAGGAGCTTT TAAAGGATGC TAAAGAAGTA
GCAATATTAC ATGATTTCAA TGAAAGTCAA AGGGGAAAAC ATATTGAACG CAATCTTAAG
GAGTTAACTT TAAAGGAATT ATCAAAATAT GATGGAAAAA ATGGGAACCC TGCTTATATA
GCAGTTAATG GCATAATATA CGATGTTACA AATGAAGCTG TTTTTAAAAA AAGTCCACAT
AATTCAGTGA ATTTAGGAGT TGATATTAAT AAAGAATTTA ATGGATGTCA TAATGCAGAT
GAAGGCGTAT TATCTAAGCT TCCTATAGTT GGAACTGTTA TGTTAAAAAA AGAGCCTTTA
GTGAAGGATA CGGGTTCTAA GGAATTTAAT ATAAATGAAC TTAGTAATTA TAATGGAAAG
AATGGAAAGC CACCTTATAT AGCAGTATTT GGGACAGTAT ATGATTTAAC TGATGTAGAT
AAATGGCAAG AAAAAAACAT AAAAGTAGGA TGTGATTTAA CTGGAGAATA TAAAGAGGTT
TATGGGAATG ATAAATCTCA TTTAAAAAAT TTAAAAGTTG CAGGGGTCTT AACTTGTAGT
TTAAATGATT ATTAA
 
Protein sequence
MYNFFEEKFE QINKLKHLMT LSKGNNQAKT YKEQIEDIFK EFQRDYIREE SNTMLILTKE 
ELANFNGENG NPAYISIDGI IYDISNIELF KQSPYNSLKL GSDVTEAFDE LNDGDESILR
DIPMVGLLAE PEEAEIMNGV EEYYPQQRLK IMSSEELKKY NGENGSLAYV AIEGIVYDIT
NLSVLELFNG TKLRLGSDIT EEFKTYYRGD KELLKDAKEV AILHDFNESQ RGKHIERNLK
ELTLKELSKY DGKNGNPAYI AVNGIIYDVT NEAVFKKSPH NSVNLGVDIN KEFNGCHNAD
EGVLSKLPIV GTVMLKKEPL VKDTGSKEFN INELSNYNGK NGKPPYIAVF GTVYDLTDVD
KWQEKNIKVG CDLTGEYKEV YGNDKSHLKN LKVAGVLTCS LNDY