Gene CPF_1232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_1232 
Symbol 
ID4203363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp1400550 
End bp1401569 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content27% 
IMG OID638082113 
Producthypothetical protein 
Protein accessionYP_695678 
Protein GI110800508 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2706] 3-carboxymuconate cyclase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0395677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAT CTATTGCATA TATAGGTACT TACACAAATG GGGCTAGTGA AGGTATCTAT 
AGATTATGTC TAGATACTGA TAAAGGAAAT ATTGAAGATT TATCCTTAGT AGCTAGATTT
GGTAATCCTA CTTATTTGTG CATACATAAC AATAAACTAT ATACAGTTGG GAACCCATTT
TCTCTAGATA CCCTAGGTGG TGTTGCTTCT TATAATATAG AAGAGGATTA TTCATTAAAA
TTAACAGGTG CTAGTTTACT TCAAGGTAAA AAACCTTGTC ATATTAATAT AATTGCAGAT
AAATCCTTAA TAGTTTCAAG TAATTATCAT GAAAAATCAA TTAATACTTA TTCATTAAAT
GAGAATTTTG ATATAGATAC TTCACTAAGT GCATTTTCTC ATAAAGATGA TTCAAAAATG
CATTTTGCAT CAATAACTCC AGATAATAAA TTTATATGTG CTGTAAATTT AGGTATGGAT
AGAATAGAAC TTTTTAAAAT TAATTCTAAT AATACCTTAA GTTACATTGA AAATCTAAGT
TTTTATTGTG CTAAAGGATG CGGTCCAAGG CATATAGAAT TTTCAAAGAA TGGAAAGTTT
GCATATGTTA TATGTGAAAA TAGTTCTGAA ATCATTATAT TAAAATATTT AGGAGAAGAA
GGATTTAAAT TAGTTCAGTA TATTCATGTA CTTCCTAATG GCTTTGGAGG ACAAAATTTT
GGTTCTGCAA TAAAAATAAG TCCTTGTAAT AAATTCCTAT ACGTTTCTAA CAGAGGCTTT
AATGGAATAT CAGCCTTTAG AATAAATGAA GAAACTGGTT CTTTATCACT TATAAATCAC
TATAGTTCAC ATGGTGATTT CCCTAGGGAT TTTGAAATCA GTCCATGTAA TAAGTTCTTA
ATCATTGCAA ATGAAAAATC AGATAACCTA ACAATTTATC TTAAAAATCC AGATGGAACA
CTAAAACTTT TAAAAAATGA TATATTTATT CCATCTCCTA CATGTATAAA GTTTAAATAG
 
Protein sequence
MNKSIAYIGT YTNGASEGIY RLCLDTDKGN IEDLSLVARF GNPTYLCIHN NKLYTVGNPF 
SLDTLGGVAS YNIEEDYSLK LTGASLLQGK KPCHINIIAD KSLIVSSNYH EKSINTYSLN
ENFDIDTSLS AFSHKDDSKM HFASITPDNK FICAVNLGMD RIELFKINSN NTLSYIENLS
FYCAKGCGPR HIEFSKNGKF AYVICENSSE IIILKYLGEE GFKLVQYIHV LPNGFGGQNF
GSAIKISPCN KFLYVSNRGF NGISAFRINE ETGSLSLINH YSSHGDFPRD FEISPCNKFL
IIANEKSDNL TIYLKNPDGT LKLLKNDIFI PSPTCIKFK