Gene CPF_2006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_2006 
SymbolspoIVA 
ID4202595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp2245619 
End bp2247094 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content31% 
IMG OID638082875 
Productstage IV sporulation protein A 
Protein accessionYP_696439 
Protein GI110801043 
COG category 
COG ID 
TIGRFAM ID[TIGR02836] stage IV sporulation protein A 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAAGATT TCAATATATA CAAAGATATA GCAGAAAGAA CACAGGGAGA TATTTATGTT 
GGAGTTGTTG GGCCAGTAAG AACAGGTAAA TCTACATTTA TAAAAAGATT TATGGACTTA
ATGGTAATAC CTAAGATTGA TAATGCTTAT AAAAAGGAAA GAGCAAAGGA TGAATTACCA
CAAAGTGGAT CAGGAAAAAC TATTCATACA ACAGAGCCTA AATTTGTACC TAATGAGGCA
GTAGAAATTG CTTTAGATGA TGGCATTAAG TTTAGTGTGA GAATGGTTGA TTGTGTTGGA
TACATTGTTA AAGGGGCTAA TGGTTATTTT GATGATGGAG AATCTAAAAA AGTTCATACT
CCTTGGTTTG ACTATGAAAT TCCATTTGAA GATGCAGCGG AGATAGGAAC TAGAAAAGTA
ATAACAGACC ATTCAACAAT AGGATTAGTA GTTACTACAG ATGGAAGTAT AACTGGTATA
GATCGTGATG ATTACTTAGA TGCTGAAGAA AGAGTTGTAG CTGAGTTAAA ATCAATAGAC
AAACCTTTTA TAATTGTACT TAATTCATTA GATCCAAGGG CAGAAGAGAC TTTAGACTTA
AAACAAGAAT TAGAAATCAG ATATGGAGTT CCAGTTCAAA TAATGGATGT AGCCAATATG
AATGAAAATG ACATAAACGA TTTATTTACA AAAGTACTTA AAGAATTTCC AGTTAAGGAA
ATTAATATAG ACATGCCAAA ATGGATTGAA AAATTAGAGC CTTCTCATTG GTTAAAATCT
AATTTTATAG ACATAGTTAA AGACATGTGT AAAAACATAT CAAAAATTAG AGACGTTAAG
GATCTACTTA GTACTTATGG AGAAGATTTC TTAGGGGTAG CAGATATAAG TGAGATGAAT
TTAGGGGATG GAACTGTAAG AGTTAAAATG ACTCCTAAAA ATGGCATCTT CTATAAAATA
ATAAGTGAAA TGTGTGATGA AGAATTAAAT GATGAAAGTG ATTTAATAGC TTTAATCAAA
GATTTGCATA AAGCAAAATC TGAATATGAT AAGGTAGCTG AAGCAATAAA TAGTGTTAAG
GAAACAGGTT ATGGACTAGT TGCTCCTCAA TTATCAGAAA TGAAGTTTGA AAAACCAGAT
ATTGATAAGC AAGGTTCAAA ATATGTTGTT AAACTTAAGG CGAGTGCTCC TAGTCTACAT
TTAATAAAAG CAGATATTCA AACAGAAATT TGCCCAATAA TGGGAACTGA AAAAGAAACT
CAAGAGGTAT TTAAAACATT ACTTGAGCAA TTTGAAAGTG ATCCGGAAAA ATTATGGCAA
AGCAATATGT TTGGTAAGTC CTTAGAGACA TTAGTTCAAG AAGGTTTAAG AAGCAAACTT
TATAAGATGC CAGATGATAT TCAAAGCAAG ATTCAAAAAA CTCTTCAAAG AATCATCAAT
GAAGGGGAAG GAAATTTAAT CTGTATTATT TTCTAA
 
Protein sequence
MEDFNIYKDI AERTQGDIYV GVVGPVRTGK STFIKRFMDL MVIPKIDNAY KKERAKDELP 
QSGSGKTIHT TEPKFVPNEA VEIALDDGIK FSVRMVDCVG YIVKGANGYF DDGESKKVHT
PWFDYEIPFE DAAEIGTRKV ITDHSTIGLV VTTDGSITGI DRDDYLDAEE RVVAELKSID
KPFIIVLNSL DPRAEETLDL KQELEIRYGV PVQIMDVANM NENDINDLFT KVLKEFPVKE
INIDMPKWIE KLEPSHWLKS NFIDIVKDMC KNISKIRDVK DLLSTYGEDF LGVADISEMN
LGDGTVRVKM TPKNGIFYKI ISEMCDEELN DESDLIALIK DLHKAKSEYD KVAEAINSVK
ETGYGLVAPQ LSEMKFEKPD IDKQGSKYVV KLKASAPSLH LIKADIQTEI CPIMGTEKET
QEVFKTLLEQ FESDPEKLWQ SNMFGKSLET LVQEGLRSKL YKMPDDIQSK IQKTLQRIIN
EGEGNLICII F