Gene CPF_0341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0341 
SymboluvrC 
ID4202089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp405519 
End bp407381 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content32% 
IMG OID638081228 
Productexcinuclease ABC subunit C 
Protein accessionYP_694801 
Protein GI110800325 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGATT TTCAACACCA ATTAAAGATA TTGCCTGATA AACCAGGAGT TTATATAATG 
AAAAATTCTC TTGGAGAAGT TATTTATGTA GGAAAAGCAA AGGTATTGAA AAATAGAGTA
AGGCAATATT TTCAAAACTC TAAGAATCAC TCTGAAAAAG TTAGAGCTAT GGTTAAGAAT
ATAGCTGAAT TTGAGTACAT AGTTACTGAT TCTGAAATGG AAGCATTAAT TTTAGAATGT
AATCTTATCA AGAAATACAG TCCAAGATAT AACATAGCCT TAAAGGATGA TAAGTTTTAT
CCCTTTATAA AAATAACCAC TAATGAGGAT TTTCCAAGGG TTTATGTTAC AAGGAATTTT
GCTAAGGATG GAAATAGATA TTTTGGACCA TATACCAATG GTACAGCCGT TTATGAGGTT
ATGGGGCTTA TAAAGAAGTT ATTCCCATTA AGAACCTGTA AAAAGGCCAT TGTGGAAGGT
GGAGAGCCTA CGAGAGCATG TTTAAATTAT CATATTAACC TTTGTAAAGC TCCTTGTGCT
GGCTATATAT CTAAGGCTGA GTACTGGGAA ATGATAGATG AAATTATAAA CATTTTAAAT
GGTACAGACA CATCCATAAT AAAAAAATTA AAATTAGAGA TGGAAAAAGC TGCAGAAGAA
TTAGAGTTTG AAAAAGCAGC TAAAATTAGA GATAGAATTT TGGCCATAGA ATTGATTAGC
GAAAAACAAA AAATGTTTAC TGTAAAAGAG GGCGATGAGG ATTTTATAGA CTTATATACT
GATGAAAAAG ATGGATGTGC TCAAGTTTTC TTTGTTAGAG AGGGAAAAGT TACAGGCAGA
GAGCACTTTA TGATTGAAAA TATTAGTGAT GATCCAGTTA AAGAGGTAAT AAGTTCCTTT
ATAGCATCCT TTTATGGAGG AACTGCACAA ATACCTAAGA CTATCTATGT GCCAGAGGAA
ATAGAGGATC AAGAGCTTAT AGAAAAGTTT CTTACAGAAA AAAGAGGATC TAAGGTTTGG
ATAAAAGTCC CTAAGAAGGG GGATAAAAAG AATCTTTTAG ATATGGTTAG AAATAATGCT
AAGATAATGC TTGATCAATT TAAAGAAAAA ATGGTTGAGG AAAAAGAGTT AAATAAGTCT
GCCTTAACTG AACTGGCAGA TGTTTTAGGA TTAGATTCTT TGCCTGCTAG AATAGAGGCT
TATGATATAT CCAATATACA GGGTGTAGAC TCAGTAGGAA CCATGGTAGT CTTTGAAAAT
GGAAAAGCAA AAAATTCAGA TTATAGGAGA TTTAAGATAA AAAGTGTTAA AGGTCCTAAT
GATTATGAAA GTATGAGAGA AATATTAAGC AGAAGATTTT CTCATGGATT AGAGGAAGTA
AATAAAATAA AAGAGAGAAA TTTAGAATAC TCAAAGGGAA AGTTTTGTAT TTTCCCAGAC
TTGATAATGA TGGATGGGGG AAAAGGGCAA GTAAACATAG CCTTAGAGGT TCTAAAGGAC
TTTGGTATAG AAATTCCTGT CTGTGGCCTT GTTAAAGACC ATAAACATAG AACTAGAGGA
ATTATATTCA ATAACGAAGA AATCCTCATT AGAAGGGGTT CAGGCCTTAT GAATCTAATA
ACTAGAGTTC AGGATGAGGT TCATAGATAT GCCATAACAT ATCATAGGAG TTTAAGAGAT
AAAAGAACCT TACATTCCAT ATTAGAAGAT ATACCTAGAA TTGGGGAAAA GAGAAGAAGA
AATCTTCTTA TGAAGTTTGG AAGTATAGAT AATATTAAAA AGGCATCTAT GGAAGAATTA
TTAGATACAC CAGGGATAGA CAAAAGAGCA GCAGAGAGTA TAAAACAATA TTTTTCAAGT
TAA
 
Protein sequence
MFDFQHQLKI LPDKPGVYIM KNSLGEVIYV GKAKVLKNRV RQYFQNSKNH SEKVRAMVKN 
IAEFEYIVTD SEMEALILEC NLIKKYSPRY NIALKDDKFY PFIKITTNED FPRVYVTRNF
AKDGNRYFGP YTNGTAVYEV MGLIKKLFPL RTCKKAIVEG GEPTRACLNY HINLCKAPCA
GYISKAEYWE MIDEIINILN GTDTSIIKKL KLEMEKAAEE LEFEKAAKIR DRILAIELIS
EKQKMFTVKE GDEDFIDLYT DEKDGCAQVF FVREGKVTGR EHFMIENISD DPVKEVISSF
IASFYGGTAQ IPKTIYVPEE IEDQELIEKF LTEKRGSKVW IKVPKKGDKK NLLDMVRNNA
KIMLDQFKEK MVEEKELNKS ALTELADVLG LDSLPARIEA YDISNIQGVD SVGTMVVFEN
GKAKNSDYRR FKIKSVKGPN DYESMREILS RRFSHGLEEV NKIKERNLEY SKGKFCIFPD
LIMMDGGKGQ VNIALEVLKD FGIEIPVCGL VKDHKHRTRG IIFNNEEILI RRGSGLMNLI
TRVQDEVHRY AITYHRSLRD KRTLHSILED IPRIGEKRRR NLLMKFGSID NIKKASMEEL
LDTPGIDKRA AESIKQYFSS