Gene CPR_2043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_2043 
SymboluvsE 
ID4204208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp2260576 
End bp2261814 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content25% 
IMG OID642566593 
Productputative UV damage endonuclease 
Protein accessionYP_699352 
Protein GI110801573 
COG category[L] Replication, recombination and repair 
COG ID[COG4294] UV damage repair endonuclease 
TIGRFAM ID[TIGR00629] UV damage endonuclease UvdE 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAG GATATGCTTG TACTCCAATA ACTACAAATG CTAGAACAAA TCGCAGAATA 
TTACTAAAGG ATTTTTCTAA AGATAAATTC CTTTCTATTG CAAATCAAAA TTTAGATGAT
TTTAAAAAAA TATTAGAATG GAATATTAAA AATAATATTT ACTTATTTAG AATAGGATCA
GATATTATCC CCTTAGGTTC CCATGAAATA AATAATATAA GCTGGCAAAA AGAATTTAAA
GATGAATTAG AAACCATAGG AACTTTTATT AAGAATAATA AAATTAGAGT TTCTATGCAT
CCTGGTCAAT ATACAGTAAT AAATACCCCT AAAGAGGATG TGTTATACAA ATCTATAAAG
GATATTGAGT ATCACTGTGA GTTTTTAGAC TCCTTAAATG TAGATTATAA AAATAAAATA
ATACTTCATA TAGGAGGGGT ATATGGCGAT AAAAAATTAG CAAAAGAAAA CTTTTTAAAG
GGATTTAAAA AGCTTTCGGA TTCTTCTAAA AAGCGATTAG TCATAGAAAA TGATGAGAGA
AATTTTTCTC TAGATGATGT TTTAGATATT TCTAGTAAAT TAAATATTCC TGTTATATTT
GATAATTTAC ATAATATATG CTATGGGGAT AATTCTTATA GTTTAAAAGA AATTTACTCA
CTTGTTATTA AAACATGGAA TAAAGAGTTA GATGGAAATA TGAAAGTACA TTATAGTGAG
CAGGACATTT TTAAAAAGAA AGGGTCCCAC TCTCCTTCAA TTTCTATAAA TAGTTTTTTA
GAATATTATG AGGAGGTTAA AGAGTATTCT CCTGACATAA TGTTAGAAGT TAAGGATAAG
GATATCTCTG CAATTAAGTG CATAAACTCC TTAAAAGAAA TAAATAAAAC ACTAAACTCT
AAAGCTTATA GGGAAGAGAT AGAAAATTAT AAATTACTTT TGCTTCAACA TGATAAAGAC
TTTCAGAAAA AGCTTAACTC CTTTTCTAAA GGTTTAATTG AATTTTATAA TTATTTAGAT
AAATTATTAC TATCTCCTAA GGAGATAATA GGATTTAAAT ACTCTTTAGA ACTAGCTTTT
AATATTTTAA AAGATCACAT AAGTAATAGA GAAAGCCTAT ATTTTAAAAA GCTCATTAAT
GAAAAAGAAT ATGAAAAAGC TAAAGTTTAT TTAACAAAGT TAGTAAAGAA AATAGAATTT
CCCCCTAAGG AATTATCTTA TTATATTTCT CAGTCTTAA
 
Protein sequence
MKIGYACTPI TTNARTNRRI LLKDFSKDKF LSIANQNLDD FKKILEWNIK NNIYLFRIGS 
DIIPLGSHEI NNISWQKEFK DELETIGTFI KNNKIRVSMH PGQYTVINTP KEDVLYKSIK
DIEYHCEFLD SLNVDYKNKI ILHIGGVYGD KKLAKENFLK GFKKLSDSSK KRLVIENDER
NFSLDDVLDI SSKLNIPVIF DNLHNICYGD NSYSLKEIYS LVIKTWNKEL DGNMKVHYSE
QDIFKKKGSH SPSISINSFL EYYEEVKEYS PDIMLEVKDK DISAIKCINS LKEINKTLNS
KAYREEIENY KLLLLQHDKD FQKKLNSFSK GLIEFYNYLD KLLLSPKEII GFKYSLELAF
NILKDHISNR ESLYFKKLIN EKEYEKAKVY LTKLVKKIEF PPKELSYYIS QS