Gene CPR_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1041 
Symbol 
ID4205306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1186028 
End bp1187047 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content28% 
IMG OID642565598 
Producthypothetical protein 
Protein accessionYP_698364 
Protein GI110802119 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2706] 3-carboxymuconate cyclase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00230518 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAAT CTATTGCATA TATAGGTACT TACACAAATG GGGTCAGTAA AGGTATCTAT 
AGATTATGTC TAGATACTGC TAAAGGAAAT ATTGAAGATT TATCCGTAGT AGCTAGATTC
GGTAATCCTA CTTATTTGTG CATACATAAC AATAAACTAT ATACAGTAGG TAACCCAATT
TCACTAGATA CACTAGGTGG TGTTGCTTCT TATAATATAG AAGAAGATTA TTCATTGAAA
TTAACAGGGG CTAGTTTACT TCAAGGTAAA AAACCTTGTC ATATTAATAT AATCCCAGAT
AAATCCTTAA TAGTTTCTAG TAATTTTCAC GAAAAATCAA TTAATACATA TTCATTAAAT
GAGAATTTTG ATATAGATAC TTCATTAAGT GCATTTTCTC ATAAAGATGA CTCAAAAATG
CATTTTGCAT CAACAACTCC AGATAACAAA TTTATATGTG CTGTAAATTT AGGTATGGAT
AGAATAGAAC TTTTTAAAAT CAATTCTAAT AACACCTTAA GTTACATTGA AAATCTAAGT
TTTTATTGTA CTAAAGGATG CGGTCCAAGA CATATAGAAT TTTCAAAGAA TGGAAAGTTT
GCATATGTTA TATGTGAAAA TAGTTCTGAA ATAATTATAT TAAAATATTT AGGTGAAGAA
GGATTTAAAT TAGTTCAATA TCTTCATGTA CTTCCTAATG GCTTTGGAGG ACAAAATTTT
GGTTCTGCAA TAAAAATAAG TCCTTGTAAT AAATTCCTAT ACGTTTCTAA CAGAGGCTTT
AATGGAATAT CAGCCTTTAG AATAAATGAG GAAACTGGTT CTTTATCACT TATAAATCAC
TATAGTTCAC ATGGTGATTT CCCTAGGGAT TTTGAAATCA GTCCATGTAA TAAGTTCTTA
GTAATTGCAA ATGAAAAATC AGATAACCTA ACAATATATT TAAAAAATCC AGATGGAACA
CTAAAACTTT TTAAAGATGA TATATTTATT CCATCTCCTA CATGTATAAA ATTTAAATAG
 
Protein sequence
MNKSIAYIGT YTNGVSKGIY RLCLDTAKGN IEDLSVVARF GNPTYLCIHN NKLYTVGNPI 
SLDTLGGVAS YNIEEDYSLK LTGASLLQGK KPCHINIIPD KSLIVSSNFH EKSINTYSLN
ENFDIDTSLS AFSHKDDSKM HFASTTPDNK FICAVNLGMD RIELFKINSN NTLSYIENLS
FYCTKGCGPR HIEFSKNGKF AYVICENSSE IIILKYLGEE GFKLVQYLHV LPNGFGGQNF
GSAIKISPCN KFLYVSNRGF NGISAFRINE ETGSLSLINH YSSHGDFPRD FEISPCNKFL
VIANEKSDNL TIYLKNPDGT LKLFKDDIFI PSPTCIKFK