Gene CPF_0337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPF_0337 
SymboluvrA 
ID4202159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens ATCC 13124 
KingdomBacteria 
Replicon accessionNC_008261 
Strand
Start bp399068 
End bp401887 
Gene Length2820 bp 
Protein Length939 aa 
Translation table11 
GC content32% 
IMG OID638081224 
Productexcinuclease ABC subunit A 
Protein accessionYP_694797 
Protein GI110800317 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGATA AGATAATAGT AAAAGGTGCA AAAGTACATA ATTTAAAAAA TGTAAGTTTA 
GAAATTCCTA GAGATAAGCT TATTGTTTTT ACTGGACTTT CAGGTTCAGG TAAAAGTTCT
CTAGCTTTCG ATACAATCTA TGCAGAAGGA CAAAGAAGAT ATGTAGAATC TTTATCATCT
TATGCAAGAC AATTCTTAGG ACAGATGGAT AAGCCAGATG TTGAGTCCAT AGAAGGACTT
TCACCTGCTA TATCTATTGA CCAAAAGACA ACTTCTAGGA ATCCAAGATC TACTGTAGGG
ACAGTAACAG AAATATATGA TTATTTAAGA TTATTATATG CAAGAGTGGG AGTGCCACAT
TGTCCTAAGT GTGGAAAAGA AATCACTCAG CAATCTGTGG ATCAAATAGT AGATCAAATT
ATGGAATTGC CAGAGAGAAG TAAAATAATG ATCTTGGCTC CAATAATAAG AGGAAGAAAA
GGAACTCATG AAAAAGTTCT TGAAAATATT AAGAAGCAAG GTTTTGTTAG GGCTAGAATA
GATGGAGAAA TTTATGATTT AACAGAAGAT GAGATAAAAC TTGAAAAAAA TATAAAGCAT
AATATAGAAG CTGTAGTTGA CAGAATAATT GTTAAGGACG GAATAGAAGG TAGATTAACA
GACTCTATAG AAACATCTCT TAAAATGGCT GAAGGATTAG TTTTAGTTAA TATAATAGGG
GAGGAAGATA GACTTTATAG TGAGCATTTT GCTTGTGCTG ATTGTGGTAT AAGTATAGAT
GAACTTGCAC CAAGAATGTT CTCATTTAAC TCACCCTTTG GAAAATGTGA AAGATGTGAT
GGATTAGGAA CTTTAATGGA AATTGATGAG GATTTAGTGG TTCCTAATAA GGATTTAAGC
ATAAGAGGAG GAGCTATTTC TACTTGGGGA GACTCAAGAA TGAAGGAAGA ATCTTGGACT
TATTGCGTCC TTAAAGCTTT AATGGAAAAG TACAACTTTG ATTTAGACAC TCCATATAAG
GATTTACCTA AGAAGGTTCA AGAGGTTTTA ATGTATGGAG AGCCAGAAAA ATTAAAGGTT
ACATATACAA AAGAAAATGT AACGGCTGTA TATAATCATT CCTTTGAGGG GGAAATAAAC
AATTTAAGAA GAAGATATAT GGAAACTAAC TCAGATACCA TGAAGGCTGA AATAGAAAAA
TATATGAGTG ATAATCCATG TCCTAAATGT AAGGGTGCAA GGCTTAAGCC AGAAGCTTTA
GCCGTTACAG TGGGAGGAAA AAATATATTT GAATTCACAA GCATGGCTAT AAGAGAAGAG
TTAGATTTTA TAAACTCAAT AAATTTCTCA GAAAAAGATA AGATAATAAG TAGTCAAATT
ATAAAGGAAA TTCAATCTAG ATTAAGTTTC TTAATAAATG TTGGCTTAGA TTACTTAGAC
TTAGCTAGAA AAGCAGGAAC TTTATCTGGG GGAGAAGCTC AGAGAATAAG ACTGGCTACT
CAAATAGGTT CTCAACTTAT GGGAGTATTA TATATCTTAG ATGAGCCTTC AATAGGTCTG
CATCAAAGAG ATAATGATAG ACTTATATCC ACATTAAAAC AACTTAGAGA TGTGGGAAAT
ACCCTTATAG TAGTAGAACA TGATGAAGAT ACTATGAGAG AAGCAGATTA CATAGTTGAT
ATAGGGCCAG GAGCTGGAGA ACATGGTGGA AAGATAGTTG CCTCAGGAAC TTTAGATGAA
ATAATGTCAA ATGAAAATTC CTTAACTGGT AAGTATTTAA CTGGAGCTAA AAAGGTTGAG
CTTCCAGAGG AAAGAAGAAA AGGCAATGGA AATTTCATAA CAGTTAAGGG TGCTAAGGAA
AATAACTTAA AAAATGTTAC TGCTAAATTT CCTTTAGGAA CTTTAACTAT GGTTACTGGA
GTTTCAGGAT CAGGAAAGAG TACTTTAGTT AATGAAATTC TTTATAAAGG ATTAAATAAA
ATCGTAAATA AAGCTAAGGA TTTACCAGGA AAGTTTAAAG AAATAACAGG ATATGAAAAT
ATTGATAAGA TTATTGATAT AGATCAAAGT CCTATAGGAA GAACTCCAAG AAGTAATCCA
GCTACTTATA CTGGAACTTT TGATATAATA AGAGAGCTTT TCTCACAAAC TCAAGAAGCT
AAAATGAGAG GGTATAAACC AGGAAGATTT TCTTTCAATG TAAAGGGTGG AAGATGTGAA
GCTTGTAGTG GAGATGGAAT AATAAAGATA GAAATGCAGT TTTTATCTGA TGTTTATGTT
CCATGTGAAG TTTGTAAGGG AAAAAGATAT AATAGAGAGA CCTTAGAAGT TAAATATAAG
GGGAAAAATA TAGCCGATGT ATTAAACATG ACTGTTGAGG AGGCTTTAGA GTTCTTTGAA
AATATTCCAA GAATAAAAAA TAAGCTTCAA ACCTTAATGG ACGTTGGTTT AGGATATATA
AGATTAGGTC AACCTTCAAC TCAATTATCA GGTGGAGAAG CTCAAAGAAT TAAATTAGCT
TATGAATTAT CTAAGAGAAG TACAGGAAAA ACCTTATATA TCTTAGATGA ACCTACAACA
GGCCTTCATA TACACGATGT AAATAGACTT GTAAAAATAC TTCAAAGATT AGTTGATGGA
GGAAATACAG TAATAGTAAT AGAACATAAT TTAGATATGA TTAAATGTGC AGATTATATA
GTTGATTTAG GTCCAGAAGG CGGAGATAAG GGTGGAACTA TTATTGCCAC AGGAACTCCT
GAGAAAATAG CTGAGGCTAA GGAATCCTAT ACAGGTAAAT ATTTAAAGAA ATATCTTTAA
 
Protein sequence
MKDKIIVKGA KVHNLKNVSL EIPRDKLIVF TGLSGSGKSS LAFDTIYAEG QRRYVESLSS 
YARQFLGQMD KPDVESIEGL SPAISIDQKT TSRNPRSTVG TVTEIYDYLR LLYARVGVPH
CPKCGKEITQ QSVDQIVDQI MELPERSKIM ILAPIIRGRK GTHEKVLENI KKQGFVRARI
DGEIYDLTED EIKLEKNIKH NIEAVVDRII VKDGIEGRLT DSIETSLKMA EGLVLVNIIG
EEDRLYSEHF ACADCGISID ELAPRMFSFN SPFGKCERCD GLGTLMEIDE DLVVPNKDLS
IRGGAISTWG DSRMKEESWT YCVLKALMEK YNFDLDTPYK DLPKKVQEVL MYGEPEKLKV
TYTKENVTAV YNHSFEGEIN NLRRRYMETN SDTMKAEIEK YMSDNPCPKC KGARLKPEAL
AVTVGGKNIF EFTSMAIREE LDFINSINFS EKDKIISSQI IKEIQSRLSF LINVGLDYLD
LARKAGTLSG GEAQRIRLAT QIGSQLMGVL YILDEPSIGL HQRDNDRLIS TLKQLRDVGN
TLIVVEHDED TMREADYIVD IGPGAGEHGG KIVASGTLDE IMSNENSLTG KYLTGAKKVE
LPEERRKGNG NFITVKGAKE NNLKNVTAKF PLGTLTMVTG VSGSGKSTLV NEILYKGLNK
IVNKAKDLPG KFKEITGYEN IDKIIDIDQS PIGRTPRSNP ATYTGTFDII RELFSQTQEA
KMRGYKPGRF SFNVKGGRCE ACSGDGIIKI EMQFLSDVYV PCEVCKGKRY NRETLEVKYK
GKNIADVLNM TVEEALEFFE NIPRIKNKLQ TLMDVGLGYI RLGQPSTQLS GGEAQRIKLA
YELSKRSTGK TLYILDEPTT GLHIHDVNRL VKILQRLVDG GNTVIVIEHN LDMIKCADYI
VDLGPEGGDK GGTIIATGTP EKIAEAKESY TGKYLKKYL