Gene CPF_0231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0231 
Symbol 
ID4203456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp280728 
End bp282158 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content27% 
IMG OID638081115 
Producthypothetical protein 
Protein accessionYP_694693 
Protein GI110801375 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACAA TAAGTAACAT ATTAAAATCT TTTTTAACTC AAAATATTTT AGGGTATATT 
GGTGTTACAT TAATTTTAAT AGTTATATGC CTAGGATTTA TCTACAATAT GAGGGTTAAG
AAAAAGTATG AGAATCTTTT AAATGCTTTT AATGAGAGTG AGTGGATTAA TGTTTCAAAT
AATGAAGATC AAAAGGAACT TAAATTTCTT AATTCAGAGT TAAAGGCAAT GGCAGATGAT
TTTAGAAAGA GTGCTATTAA AGGAACATAT AATATTAATA CAGAGGTAAT AATTCAAAAG
AATATAGAAA AGAGAATATT AAAAGAAGAA AATATTGCAA ATATACTGCC ATCTATTTCA
ATAGCTTTAG GTCTTATAGG TACATTTTTA GGGCTTACTG TGGCTATTAT GACAACAACA
GGTATATTAT CCTCTGGAAC ACAGAGCATG GCTGATTTCT CAAGAAGTAT GAATATGCCT
CTAGGAAGTA TGTCTAGTGC CTTTTGGACA AGTATTGTAG GGGTAATAGG TTCTATCATC
TTAAATTGTC TAAACATTAA TTTAAAGAGA GCTAAAGAGG ATTTTTATGA TGTTTTTGAA
GACTATTTAG ATAACATATT ATTTTCTATT TATATAAAAT CAGATAAAGA TGTACTTACT
GATTTTATGA ATAAAGCTTT AGGAAGTTTT TCAATAAAGT TAGATGAATT ATTTAACAAG
GGAATAAATA ATTTAGTTGA AGGCATAAAT AAAAACACCG TAGATATGAC AAGTACCTTT
AATGGAATGA ATAAACATAT AAGTGATTTA GAGAATATTG TATCATACTT TAGAGTTTCA
GTTATGGGAA TAAAAACACC TATAAAAAGC TTTGGGGAAG ATTTAGAAAA GTTTAGTAGG
GTAAGTGCAG AGTTTTGTAA TTCTGTAAAT TCAACAAATA ATAAATTCCT TAATAAATCT
ACTAGACTTG AGGAGAGTGT AAATGAACTA TCAAAGGCTA TAAATGATAA TAAGAGAGAA
ATAAATAAGC TTTGTATGAG ATTAGAGGTT CAAAGTAATG AGGTGAGAAG AGGATATAAT
ACCTTTAATG ATGTCTTTAA TATTATAAAA GATGAGCAAA GCAGAAATAA TAAAATTATT
TCAAATCAAA TTTCAAAATT AAATGATGGA TATAGAGAAT TTGAAAGAGG AATAAATAAA
TTTTCAAGCA ATGTTTCAAA TATGAAAAAT GAAGTTGCAA ATGGTATATA CTATGCTCTT
CAAAGAGAAA CTCAAGAACT TTCATCAAAT ATAGTTGATA AATTAAATGT TCCTCTAAGA
GATATTAGCC TAGCAACTGA GGAGTTAAGT AGAAGTACTA ATAGGAATAG AGAAATTGGA
AAGGCAGATT CAGGGTGGAA AAAAGATAGT AAAGAGATTG GTAGGTGTTA A
 
Protein sequence
MSTISNILKS FLTQNILGYI GVTLILIVIC LGFIYNMRVK KKYENLLNAF NESEWINVSN 
NEDQKELKFL NSELKAMADD FRKSAIKGTY NINTEVIIQK NIEKRILKEE NIANILPSIS
IALGLIGTFL GLTVAIMTTT GILSSGTQSM ADFSRSMNMP LGSMSSAFWT SIVGVIGSII
LNCLNINLKR AKEDFYDVFE DYLDNILFSI YIKSDKDVLT DFMNKALGSF SIKLDELFNK
GINNLVEGIN KNTVDMTSTF NGMNKHISDL ENIVSYFRVS VMGIKTPIKS FGEDLEKFSR
VSAEFCNSVN STNNKFLNKS TRLEESVNEL SKAINDNKRE INKLCMRLEV QSNEVRRGYN
TFNDVFNIIK DEQSRNNKII SNQISKLNDG YREFERGINK FSSNVSNMKN EVANGIYYAL
QRETQELSSN IVDKLNVPLR DISLATEELS RSTNRNREIG KADSGWKKDS KEIGRC